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Introduction 

Background:  Deregulation  of  the  cellular  myc  proto-oncogene  is  one  of  the  strongest 
activators  of  tumorigenesis  and  understanding  the  target  genes  and  pathways  regulated  by  this 
transcription  factor  in  cancer  etiology  will  clearly  mark  a  key  advance.  Myc  expression  and 
activity  are  highly  restricted  in  normal  human  mammary  epithelial  cells  (HMECs),  but 
unleashed  and  deregulated  in  cells  of  malignant  transformations.  Because  the  protein  product  is 
identical  in  normal  and  tumor  cells,  three  major  issues  arise.  First,  a  definitive  approach  to 
detect  oncogenic  Myc  in  primary  tumor  specimens  is  severely  lacking  and  long  overdue.  This 
issue  has  plagued  the  field  during  the  two  decades  since  Myc  was  first  discovered.  Second,  it 
remains  unclear  whether  Myc  function  is  different  in  normal  and  tumor  cells.  Myc  may  regulate 
the  same  subset  of  target  genes  in  both  settings,  but  in  a  more  robust  manner  in  tumor  cells.  By 
contrast,  deregulated,  overexpressed  Myc  protein  may  bind  and  regulate  an  additional  unique 
set  of  target  genes  in  tumor  cells.  This  issue  has  not  yet  been  explored.  Third,  although  it  is 
clear  that  inhibiting  Myc  can  trigger  tumor  regression  and  eradication  in  animal  models,  few 
initiatives  are  underway  to  target  Myc  as  a  therapeutic  approach  for  human  disease.  Because 
Myc  protein  in  normal  and  tumor  cells  is  indistinguishable,  it  is  thought  that  anti-Myc  inhibitors 
would  have  little  to  no  tumor  specificity  or  therapeutic  index.  Clearly  a  novel  approach  is 
required. 

Hypothesis:  Our  ‘idea’  is  that  in  addition  to  the  target  genes  regulated  by  Myc  in  non- 
transformed  cells,  constitutively  activated  and  overexpressed  Myc  protein  in  tumor  cells  will 
directly  bind  and  regulate  a  unique  set  of  target  genes  that  directly  contribute  to  the  carcinogenic 
process.  For  example,  at  high  levels  of  expression,  Myc  may  bind  low  affinity  sites  and  regulate 
a  distinct  cohort  of  targets  by  a  unique  mechanism  of  action.  By  identifying  this  transformation 
specific  subset  of  Myc  target  genes  we  aim  to  develop  a  diagnostic  tool  to  identify  oncogenic 
Myc  activity  in  breast  tumor  cells.  We  also  aim  to  develop  a  unique  anti-cancer  therapeutic  that 
will  potentially  target  this  unique  transforming  activity  of  Myc.  The  TRRAP  cofactor  has  been 
shown  to  be  essential  for  Myc  to  drive  transformation.  This  suggests  blocking  Myc:TRRAP 
interaction  will  inhibit  the  carcinogenic  program  directed  by  oncogenic  Myc.  By  conducting  the 
experiments  outlined  in  this  proposal  we  will  test  a  unique  hypothesis  and  will  make  significant 
contributions  to  the  molecular  diagnosis  and  treatment  of  breast  cancer  that  can  be  applied  to  the 
clinic  in  a  timely  manner 

Specific  Aims: 

1)  Identify  tumor-specific,  directly-regulated  Myc  target  genes  in  transformed  HMECs  and 
develop  a  definitive  diagnostic  tool  to  detect  oncogenic  Myc  activity  in  breast  cancer. 

2)  Isolate  small  molecular  weight  inhibitors  that  can  disrupt  Myc:TRRAP  interaction  in 

vivo  and  identify  Myc:TRRAP  co-bound  target  genes  in  breast  cancer. 

Body 

With  support  from  the  DOD,  the  research  outlined  in  the  original  proposal  has  progressed  in 
a  steady  and  productive  manner  throughout  the  course  of  this  grant  despite  several  unexpected 
technical  problems  along  the  way.  To  delineate  the  accomplishments  to  date,  the  tasks  outlined 
in  the  original  Statement  of  Work  of  the  proposal  are  itemized  below  (italics)  and  a  progress 
report  for  each  task  provided. 
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Statement  of  Work 

Task  1:  Identify  tumor-specific,  directly-regulated  Myc  target  genes  in  transformed  HMECs 
( months  1-36) 

a.  Directly  compare  fresh  and  formalin-fixed  tissue  for  efficacy  of  Chip-on-chip  assay 
using  samples,  including  archived  samples,  prepared  by  the  Andrulis  group  (months  1- 
6) 

Completed  in  year  1 :  We  anticipated  that  archived  formalin- fixed  tissue  could  be  used 
for  ChIP-on-chip  analysis,  however,  over  the  course  of  the  first  year,  we  learned  that 
formalin  fixation  conducted  in  pathology  labs  is  not  similar  to  the  formaldehyde 
fixation  used  in  our  research  lab  as  an  early  step  of  the  ChIP-on-chip  procedure.  We 
suspect  the  issue  has  more  to  do  with  the  heterogeneity  in  the  methodology  used  by 
different  technicians/residents  over  the  years  in  various  pathology  labs  (formalin 
formulation,  time  of  fixation,  etc).  To  overcome  this  problem,  we  evaluated  whether 
frozen  tissue  could  be  used  as  an  alternate  source.  We  are  pleased  to  report  that  we  can 
achieve  robust  ChIP-on-chip  results  using  0.03g  of  fresh,  frozen  tissue.  Importantly,  we 
can  move  forward  with  the  study,  using  frozen  instead  of  formalin- fixed  primary  tissue 
for  our  ChIP-on-chip  analysis. 

b.  Evaluate  minimum  numbers  of  cells  that  is  required  for  quality  assured  Chip-on-chip  of 
tissue  sections,  test  reproducibility  (months  6-12) 

Completed  in  years  1  to  3:  When  we  first  launched  this  research  we  had  several 
parameters  of  the  ChIP-on-chip  assay  to  optimize  to  ensure  we  could  achieve  the  high 
sensitivity  and  specificity  required  to  apply  this  novel  technology  to  breast  cancer  cell 
lines  and  primary  patient  specimens.  In  year  1  we  compared  and  modified  two 
amplification  methods  to  determine  how  few  cells  are  required  to  achieve  efficient 
ChIP-on-chip  results  without  introducing  extreme  bias  due  to  amplification.  In  year  2, 
several  additional  parameters  were  evaluated  and  conditions  determined  to  ensure 
outstanding  reproducibility  of  this  new  technology.  This  data  was  well-received  when 
it  was  presented  in  2006  at  the  AACR  Annual  Meeting  and  a  Cold  Spring  Harbor 
Systems  Biology  Meeting. 

In  year  3,  our  goal  was  to  apply  our  optimized  ChIP-on-chip  assay  to  map  the  genomic 
localization  of  Myc-binding  using  frozen  breast  tissue.  However,  the  archived  tissue 
from  the  ANN  tissue  bank  of  our  collaborator  (Andrulis)  showed  enormous  variability, 
that  we  assume  was  due  to  the  lengthy  storage  times  of  these  samples.  These  variability 
issues  were  not  evident  in  fresh,  frozen  tissue  used  as  a  positive  control.  An  alternative 
source  of  ANN  was  not  readily  available,  so  we  decided  to  evaluate  our  optimized 
ChIP-on-chip  protocol  on  breast  cancer  cell  lines.  However  we  were  not  able  to 
achieve  the  sensitivity  and  specificity  required.  Even  by  increasing  the  number  of  these 
adherent  breast  cells  to  very  high  levels  (60  million  cells  per  ChIP  reaction),  the  signal 
to  noise  ratio  was  not  sufficiently  robust  to  move  forward  from  ChIP  to  the  ChIP-on- 
chip  experiments.  We  have  overcome  this  issue  very  recently  by  changing  from  a 
probe  sonicator  approach  to  the  far  more  efficient  Bioruptor  sonication  system.  We 
were  only  able  to  purchase  the  Bioruptor  Instrument  about  a  month  ago  after  applying 
and  securing  funding  from  our  Institutional  Capital  Equipment  Competition  at  the 
beginning  of  2007.  Because  of  this  delay,  we  moved  the  experiments  forward  and 
completed  our  study  to  test  our  optimized  parameters  for  ChIP-on-chip  using  non¬ 
adherent  HL60  cells.  These  cells  have  high  levels  of  endogenous  Myc  expression  and 
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can  be  grown  in  bulk  as  suspension  cells,  thus  ChIP-on-chip  can  be  conducted  despite 
the  poor  recovery  of  ChIP  material  using  the  old  probe  sonicator  method.  We  have 
completed  the  experiments  associated  with  this  work  and  have  recently  submitted  a 
manuscript,  entitled,  “Optimization  of  experimental  design  parameters  for  high- 
throughput  chromatin  immunoprecipitation  studies”  that  is  presently  under  review  for 
publication. 

c.  Conduct  the  Myc  specific  Cliip-on-chip  assay  in  sixplicate  with  reverse  dye  labeling  on 
one  sample  that  has  been  processed  for  lasar  capture  microdissection  for  normal  and 
tumor  isogenic  matched  sample,  in  duplicate  (months  12  -18)  Yet  to  do* 

d.  Conduct  Myc  specific  Chip-on-chip  assay  on  2  additional  genetically  similar  patient 
samples  ( months  12-18)  Yet  to  do* 

e.  Conduct  Myc  specific  Chip-on-chip  on  3  similar  samples  whose  underlying  genetic 
abnormalities  are  distinct  from  the  first  series  of  3  (months  18-24)  Yet  to  do* 

f.  Conduct  data  analysis  to  evaluate  how  target  genes  compare  between  isogenic  normal 
and  tumor  matched  material  and  between  groups  that  harbor  distinct  genetic 
abnormalities  and  between  all  samples  to  identify  a  cohort  commonly  bound  and 
regulated  by  Myc  in  tumor  but  not  normal  tissue.  ( months  24  -32)  Yet  to  do* 

g.  Further  evaluate  the  diagnostic  potential  of  this  common  cohort  and  evaluate  whether 
cDNA  expression  profiling  shows  these  targets  are  similarly  regulated  in  ANN  tumors 
of  a  specific  subtype  or  genetic  background  (months  24  -32)  Yet  to  do* 

h.  Extend  analysis  to  tissue  arrays  using  in  situ  hybridization  or  IHC  (months  24-36)  Yet 
to  do* 

i.  Evaluate  cDNA  expression  array  data  for  genes  identified  as  regulated  by  Myc  in  the 
MCF10A  system  (months  12-36)  Yet  to  do* 

*Due  to  unexpected  difficulties  of  working  with  archived  formalin,  and  then  frozen  breast 
specimens,  along  with  the  added  complexity  of  breast  cells  and  the  need  for  the  Bioruptor 
system,  we  were  not  able  to  conduct  the  ChIP-on-chip  experiments  proposed.  However,  we  are 
dedicated  to  the  breast  cancer  question  and  these  optimized  methods  will  now  be  applied  to 
breast  cancer.  Indeed,  recent  ChIP  results  show  that  with  the  Bioruptor  we  can  achieve  an 
excellent  signal  to  noise  ratio  with  only  10  million  cells.  A  further  reduction  in  cell  number  is 
likely  achievable  and  is  presently  under  evaluation.  With  these  recent  technical  advances,  we  are 
now  in  a  strong  position  to  conduct  Myc-specific  ChIP-on-chip  on  both  breast  cancer  cell  lines 
and  primary  fresh,  frozen  biopsies.  We  have  recently  established  collaborations  with  Dr.  Susan 
Done  who  has  established  a  specific  breast  tumour  bank  at  our  Institute  in  recent  years,  that 
specializes  in  the  preservation  and  analysis  of  small  primary  breast  tumours  (<2cm)  as  well  as 
charting  patient  treatment  and  outcome.  We  look  forward  to  working  with  Susan  and  her  new 
state-of-the-art  facility.  We  will  analyze  these  tumours  for  elevated  Myc  expression  (using 
affymetrix  expression  arrays)  and  then  further  evaluate  whether  elevated  expression  is  due  to 
amplification  of  the  c-myc  locus  using  the  Taqman  and  FISH  assays  that  we  have  developed  for 
this  purpose.  As  proposed  we  will  then  conduct  the  ChIP-on-chip  experiments,  now  that  we  have 
the  protocol  optimized  for  this  purpose.  It  is  because  of  this  DOD  funding  that  we  could  move 
this  technology  forward  for  the  analysis  of  patient  specimens.  These  advances  will  enable  our 
own  research  in  this  area,  but  will  also  be  used  by  others  whose  goals  are  to  use  ChIP-on-chip  in 
their  breast  cancer  research.  We  will  acknowledge  the  support  of  the  U.S.  DOD  when  these 
studies  are  published. 
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Task  2:  Establish  MCF10A  cell  system  and  identify  tumor-specific,  directly-regulated  Myc 
target  genes  (months  1-36) 

a.  Introduce  ectopic  Myc  expression  in  the  MCF10A  cells  and  evaluate  biological  effect  at 
the  level  of  proliferation,  apoptosis  induction  in  standard  culture  conditions  (months  1- 
6)  Completed  in  year  1 .  We  have  ectopically  expressed  Myc  in  the  MCF10 A  cells  and 
have  characterized  cell  growth  and  death,  as  proposed.  Myc  potentiates  both  cell 
proliferation  and  apoptosis.  This  was  shown  in  our  year  one  report. 

b.  Conduct  Myc  ChIP-on-chip  analysis  and  cDNA  array  analysis  to  identify  target  genes 
directly  bound  and  regulated  by  Myc  under  asynchronous  conditions  (months  1-12) 
Completed  in  years  1  and  2.  We  conducted  in  silico  analysis  on  the  expression  array 
data  from  the  ANN  samples  of  Irene  Andrulis  and  showed  that  H19  expression  was 
highly  associated  with  elevated  Myc  expression  in  these  tumours.  We  conducted  ChIP 
analysis  in  breast  cancer  cell  lines  and  identified  H19  as  a  novel  Myc  target  gene 
regulated  by  Myc  in  MCFlOAs.  We  were  able  to  conduct  ChIP  using  Myc-specific 
antibody  while  assaying  a  specific  target  gene,  such  as  HI 9,  but  the  method  was  not 
robust  enough  for  ChIP-on-chip  (see  above).  With  this  positive  control  established,  we 
went  on  to  study  the  regulation  of  H19  by  Myc  and  to  study  the  role  of  H19  in  breast 
cancer.  We  further  showed  that  Myc  induction  of  H19  plays  a  role  in  Myc  induced 
transformation  by  conducting  siRNA  knock-downs  in  breast  cancer  cell  lines  that  do 
and  do  not  express  HI 9.  Indeed,  the  loss  of  H19  inhibits  full  transformation  potential 
of  these  breast  cancer  cells.  Thus  we  show  that  we  have  identified  at  least  one  novel 
Myc-induced  gene  (H19),  using  the  breast  cancer  cell  system,  that  plays  an  important 
role  in  transformation.  Only  a  handful  of  Myc  target  genes  have  been  shown  to  play  a 
role  in  transformation,  thus  it  was  important  to  focus  on  this  single  important  target  and 
ensure  the  publication  of  this  information  in  a  timely  manner.  To  this  end,  a  manuscript 
describing  this  work  was  been  published  in  Cancer  Research  (Appendix  1).  In 
addition,  this  manuscript  was  highlighted  in  Nature  Reviews  Cancer  in  the  July  2006 
issue  (Appendix  2).  Using  the  MCF10A  system  and  our  ChIP-on-chip  analysis,  we  aim 
to  identify  similar  Myc  targets  that  are,  like  HI 9,  Myc-regulated  and  key  to  breast 
cancer  development,  as  originally  proposed.  We  and  others  are  now  targeting  H19  for 
the  development  of  anti-breast  cancer  therapeutics. 

c.  Into  the  control  and  Myc  expressing  cells  introduce  activatable  erbB2  and  p53-DD 
(months  6-12).  No  longer  necessary.  Originally  it  remained  unclear  whether 
deregulated  Myc  alone  would  be  able  to  transform  the  MCF10A  cells,  as  assessed  by 
anchorage-independent  growth  in  soft  agar.  It  was  anticipated  that  additional  genetic 
lesions,  such  as  erbB2  and/or  p53-DD  would  be  necessary  and  was  proposed. 
However,  these  additional  genetic  lesions  are  not  required.  Ectopic  Myc  alone  is  able 
to  transform  the  MCF10A  cells.  Thus  our  experimental  model  system  was  established 
in  the  absence  of  erbB2  and  p53-DD. 

d.  Assay  cells  for  growth  in  soft  agar  as  well  as  proliferation  and  apoptosis  assays 
(months  12-18)  Complete.  The  MCF10A  +/-  Myc  cells  have  been  assayed  thoroughly 
and  form  the  basis  of  our  future  work.  Myc  potentiates  proliferation,  apoptosis  and 
growth  in  soft  agar  of  these  cells.  Several  strains  of  MCF10A  were  evaluated  and  all 
behaved  similarly  with  respect  to  Myc  effects  on  cellular  growth,  death  and 
tumorigenesis.  This  is  an  enormous  break-through  as  we  can  measure  the  molecular 
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mechanism  of  Myc-induced  transformation  as  ectopic  Myc  expression  converts  these 
immortal,  non-transformed  cells  to  a  further  transformed  state  as  measured  by 
anchorage  independent  growth.  This  system  will  be  used  to  identify  the  critical 
transformation-associated  genes  regulated  by  Myc. 

e.  Assay  Myc  ChIP-on-chip  and  cDNA  arrays  on  these  cells  expressing  ectopic  Myc 
and/or  erbB2  and/or  p53-DD,  under  asynchronously  growing  conditions  ( months  18- 
32)  Yet  to  do#. 

f  Assay  more  transformed  cells  for  invasion,  polarity,  morphology  properties  ( months  18 
-32)  Yet  to  do#. 

g.  Test  for  genetic  abnormalities  and  their  effects  on  cell  differentiation  in  matrigel 
( months  24-36)  Yet  to  do#. 

h.  Assay  all  cells  for  Myc  Chip-on-chip  and  cDNA  arrays  when  grown  in  soft  agar 
(months  24-36)  Yet  to  do#. 

#We  conducted  the  ChIP-on-chip  assays  with  the  MCF10A  cells,  with  and  without  ectopic  Myc 
expression,  but  it  was  with  this  work  that  we  further  realized  the  sensitivity  and  specificity  of  the 
assay  was  insufficient.  After  conducting  10  biological  replicates  we  could  see  that  we  had 
achieved  signal  over  noise,  as  evident  by  volcano  plots,  but  the  signal  did  not  achieve  statistical 
significance,  despite  the  high  number  of  replicates.  Rather  than  continuing  to  repeat  the  assay, 
we  decided  we  had  to  trouble-shoot  the  protocol  once  again  to  determine  the  problem  and  then  to 
overcome  this  rate-limiting  step.  As  discussed  above,  this  turned  out  to  be  the  recovery  of 
specific  cross-linked  material  from  the  breast  cancer  cells  and  this  was  solved  by  moving  to  a 
more  efficient  sonication  system  (Bioruptor).  Thus,  we  have  a  robust  cell  system  and  ChIP-on- 
chip  protocol  that  are  working  well.  We  have  been  able  to  develop  the  assay  to  work  for  both 
Myc  and  TRRAP-specific  antibodies  (see  Figure  1).  We  will  acknowledge  the  DOD  for  funding 
support  when  this  work  is  complete  and  published. 

Task  3:Isolate  small  molecular  weight  inhibitors  that  can  disrupt  Myc :TRRAP  interaction  in  vivo 
and  identify  Myc.  TRRAP  co-bound  target  genes  in  breast  cancer. 

a.  Develop  and  test  antibodies  to  TRRAP  for  ChIP  (months  1-12)  Completed.  After 
much  trouble-shooting  we  are  able  to  readily  ChIP  TRRAP  from  live  cells  using  a 
homemade  antibody.  The  specificity  of  the  antibody  has  been  rigorously  evaluated  and 
the  ChIP  results  are  robust  and  quantitative.  This  has  enable  TRRAP  ChIP-on-chip  to 
be  conducted  in  the  MCF10A  cell  system  described  above.  We  aim  to  also  advance 
these  assays  to  primary  patient  material,  as  proposed.  By  this  approach  the  target  genes 
that  are  co-bound  by  Myc  and  TRRAP,  that  are  essential  for  breast  cancer 
transformation,  will  be  identified. 

b.  Develop  Myc  and  TRRAP  interacting  fragments  in  new  screening  system  and  evaluate 
interaction  and  susceptibility  to  inhibition  with  TRRAP  polypeptide  (months  1-12) 
Conducted  in  Year  2  and  3.  Before  setting  up  the  screening  system  we  needed  to  ensure 
the  fragment  of  TRRAP  shown  to  interact  with  Myc  did  indeed  interact  with  Myc  in 
vivo.  This  has  been  evaluated  and  is  submitted  for  publication  (Appendix  3).  Targeting 
Myc  as  a  novel  anti-cancer  therapeutic  has  enormous  potential  (for  more  information, 
see  our  review  Appendix  4).  With  these  results  we  went  forward  to  establish  the 
screening  system,  as  proposed,  but  the  system  did  not  work  with  our  positive  and 
negative  controls  (See  Figure  2A,  F8  and  FI  respectively).  The  RTA  was  modified, 


8 


however  after  conducting  the  screen  we  characterized  the  few  positive  hits  and  revealed 
that  they  were  all  false  positives.  This  was  disappointing  and  frustrating. 

c.  Conduct  screen  and  test  positives  in  the  RTA  against  Myc:TRRAP  as  well  as  other 
interactors  of  Myc  and  other  interactors  of  TRRAP  to  evaluate  specificity  of  the 
inhibitor  (months  12-18)  Conducted  in  year  3.  The  hits  were  shown  to  be  false 
positives. 

d.  Advance  inhibitors  to  mammalian  cell  assays,  including  growth,  transformation  DNA 
binding  using  a  Chip-on-chip  approach  (months  18-36)  Not  achievedA 

AWith  the  disappointing  results  of  our  screen,  and  in  the  interest  of  time,  we  decided  it  was  not 
worthwhile  to  further  pursue  the  RTA  as  a  screening  system.  The  rate  of  false  positives  was  too 
high  and  there  was  insufficient  time  to  re-tool  the  assay  and  conduct  the  screen  again  before  the 
grant  expired.  We  decided  to  be  productive  in  the  time  remaining  (4  months)  we  would  work 
towards  identifying  another  region  of  Myc:TRRAP  interaction  that  could  potentially  also  be 
targeted  for  anti-cancer  therapeutics.  Our  friend  and  collaborator  Michael  Cole  shared 
information  with  us  that  he  had  identified  that  amino  acids  20  to  40  of  Myc  can  also  bind  to 
TRRAP  (for  details,  see  his  recent  review  article1),  so  we  evaluated  the  region  of  TRRAP  that 
might  be  involved.  This  builds  on  our  previous  work  and  enabled  us  to  further  exploit  reagents 
we  had  already  made  in  the  lab.  Indeed,  we  have  been  able  to  identify  another  region  of 
TRRAP,  Myc  Binding  Domain  2  (MBD2),  that  interacts  with  Myc  (see  Figure  2B). 

Key  Research  Accomplishments 

•We  have  exploited  the  ANN  expression  data  and  identified  H19  as  a  bone  fide  Myc  target  gene 
that  contributes  to  transformation  of  breast  cancer.  This  has  been  published. 

•We  have  established  that  deregulated  Myc  expression  is  sufficient  to  transform  MCF10A  cells 
as  defined  by  anchorage  independent  growth  in  soft  agar.  This  system  is  ideal  to  further  identify 
the  entire  subset  of  genes  Myc  regulates  to  transform  human  mammary  epithelial  cells. 

•We  have  optimized  the  ChIP-on-chip  protocol  and  have  been  able  to  successfully  trouble-shoot 
several  issues  associated  with  ChIP-on-chip  in  breast  cell  and  primary  patient  tissue. 

•Myc  and  TRRAP  interaction  domains  have  been  thoroughly  analyzed  and  fully  characterized. 
This  has  resulted  in  a  publication  that  has  been  submitted. 

•We  have  identified  a  novel  region  of  TRRAP  (MBD2)  that  interacts  with  Myc  as  shown  by 
ectopic  expression  followed  by  co-immunoprecipitation  experiments. 

Reportable  Outcomes 

•Poster  presented  at  the  Era  of  Hope  Department  of  Defense  Breast  Cancer  Research  Program 
Meeting,  June  8-  11,  2005,  Pensylvania  Convention  Centre,  Philadelphia,  Pennsylvania 

•  Barsyte-Lovejoy  D,  Lau  SK,  Boutros  PC,  Khosravi  F,  Jurisica  I,  Andrulis  IL,  Tsao  MS,  Penn 
LZ.  The  c-Myc  oncogene  directly  induces  the  H19  noncoding  RNA  by  allele-specific  binding  to 
potentiate  tumorigenesis.  Cancer  Res.  2006  May  15;66(10):5330-7.  (see  Appendix  1) 

•Nature  Reviews  Cancer  -  The  Silent  Messenger.  July  (6)  2006:488.  Highlight  of  Cancer 
Research  article  above  (see  Appendix  2) 

•Ponzielli  R,  Katz  S,  Barsyte-Lovejoy  D,  Penn  LZ.  Cancer  therapeutics:  targeting  the  dark  side 
of  Myc.  Eur  J  Cancer.  2005  Nov;41(16):2485-501.  Review,  (see  Appendix  3) 

•  Lilia  Kaustov,  Sigal  Katz,  Cynthia  S.W.  Ho,  Romina  Ponzielli,  Shili  Duan,  Steve  McMahon, 
Michael  D.Cole,  Linda  Z.  Penn,  and  Cheryl  H.  Arrowsmith ,  Characterizing  cMyc  MBII  and  its 
interaction  with  TRRAP,  submitted  and  under  review,  (see  Appendix  4) 
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•Optimizing  ChIP-on-chip  and  reporting  on  key  parameters  of  technology  -  presented  at  AACR 
Annual  Meeting,  Washington,  DC,  April  1-5,  2006  (Penn  oral  presentation,  Boutros  poster 
presentation);  also  presented  at  Cold  Spring  Harbor  Labs,  System  Biology  Conference,  Cold 
Spring  Harbor,  NY,  Mar  23  -26,  2006  (Boutros  poster  presentation) 

•Oral  presentation  by  Penn  at  “Reasons  for  Hope”  Conference,  sponsored  by  the  Canadian 
Breast  Cancer  Research  Alliance,  Montreal,  Quebec,  May  6-8,  2006 

•Submission  of  manuscript  entitled,  “Optimization  of  experimental  design  parameters  for  high- 
throughput  chromatin  immunoprecipitation  studies”,  (see  Appendix  5) 

Conclusions 

Several  publications  have  arisen  from  this  research  and  several  additional  significant 
contributions  are  expected. 

1 .  We  established  a  useful  model  system  to  recapitulate  the  transition  from  non-transformed 
(MCF10A  cells)  to  transformed  (MCF10A  cells  with  deregulated  Myc  expression) 
human  mammary  epithelial  cells.  This  system  is  the  best  we  could  have  hoped  for.  It  is 
simple  and  does  not  involve  the  need  to  introduce  additional  genetic  abnormalities  to 
achieve  anchorage  independent  growth.  This  enables  the  genetics  to  be  readily 
interpreted.  Indeed,  this  system  enabled  the  Myc  target  gene  H19  to  be  quickly  evaluated 
for  it  mechanism  of  regulation  and  importantly,  its  role  in  breast  cancer  transformation. 

2.  We  have  developed  the  technologies  to  determine  which  tumors  have  high  Myc 
expression  due  to  gene  amplification  (Taqman,  FISH)  and  to  determine  the  genomic 
localization  of  Myc  using  a  significantly  optimized  ChIP-on-chip  protocol  for  primary 
and  established  cells  derived  from  breast  cancers. 

3.  We  have  developed  critical  antibodies  that  recognize  TRRAP  and  work  in  ChIP  to 
identify  TRRAP-bound  target  genes  in  a  sensitive  and  specific  manner.  These  valuable 
reagents  will  be  used  for  further  ChIP-on-chip  analyses,  as  proposed. 

4.  We  have  established  two  regions  of  Myc  and  TRRAP  interaction. 

Salaries  paid  from  this  grant:  Sigal  Katz,  Paul  Boutros,  Fereshteh  Khosravi,  Angelina 
Stojanova.  In  addition,  Romina  Ponzielli  and  Dalia  Barsyte-Lovejoy  worked  on  aspects  of  this 
work,  but  they  had  secured  their  own  scholarships  to  cover  their  salaries. 
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The  c-Myc  Oncogene  Directly  Induces  the  HI  9  Noncoding  RNA  by 
Allele-Specific  Binding  to  Potentiate  Tumorigenesis 

Dalia  Barsyte-Lovejoy,1  Suzanne  K.  Lau,34  Paul  C.  Boutros,1 2  4  Fereshteh  Khosravi,1  Igor  Jurisica,24  6 
Irene  L.  Andrulis,7  Ming  S.  Tsao,34  and  Linda  Z.  Penn'  4 

Divisions  of  'Cancer  Genomics  and  Proteomics,  Signaling  Biology,  and  3Applied  Molecular  Oncology,  Ontario  Cancer  Institute/Princess 
Margaret  Hospital;  Departments  of  4Medical  Biophysics  and  laboratory  Medicine  and  Pathobiology,  6Computer  Science,  University 
of  Toronto,  Canada;  and  7Fred  Litwin  Cancer  Genetics  Center,  Samuel  Lunenfeld  Research  Institute  Mount  Sinai  Hospital, 
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Abstract 

The  product  of  the  MYC  oncogene  is  widely  deregulated  in 
cancer  and  functions  as  a  regulator  of  gene  transcription. 
Despite  an  extensive  profile  of  regulated  genes,  the  transcrip¬ 
tional  targets  of  c-Myc  essential  for  transformation  remain 
unclear.  In  this  study,  we  show  that  c-Myc  significantly  induces 
the  expression  of  the  H19  noncoding  RNA  in  diverse  cell  types, 
including  breast  epithelial,  glioblastoma,  and  fibroblast  cells. 
c-Myc  binds  to  evolutionarily  conserved  E-boxes  near  the 
imprinting  control  region  to  facilitate  histone  acetylation  and 
transcriptional  initiation  of  the  H19  promoter.  In  addition, 
c-Myc  down-regulates  the  expression  of  insulin-like  growth 
factor  2  ( IGF2 ),  the  reciprocally  imprinted  gene  at  the  H19/ 
IGF2  locus.  We  show  that  c-Myc  regulates  these  two  genes 
independently  and  does  not  affect  H19  imprinting.  Indeed, 
allele-specific  chromatin  immunoprecipitation  and  expression 
analyses  indicate  that  c-Myc  binds  and  drives  the  expression  of 
only  the  maternal  H19  allele.  The  role  of  H19  in  transforma¬ 
tion  is  addressed  using  a  knockdown  approach  and  shows 
that  down-regulation  of  H19  significantly  decreases  breast  and 
lung  cancer  cell  clonogenicity  and  anchorage-independent 
growth.  In  addition,  c-Myc  and  H19  expression  shows  strong 
association  in  primary  breast  and  lung  carcinomas.  This 
work  indicates  that  c-Myc  induction  of  the  H19  gene  product 
holds  an  important  role  in  transformation.  (Cancer  Res  2006; 
66(10);  5330-7) 

Introduction 

The  transforming  members  of  the  Myc  family  (c-Myc,  N-Myc, 
and  L-Myc)  show  deregulated  expression  in  a  broad  spectrum  of 
cancers,  including  carcinomas  of  the  lung,  breast,  and  prostate  as 
well  as  leukemias  and  lymphomas  (1).  c-Myc  is  a  transcription 
factor  that,  with  its  obligate  heterodimerization  partner  Max,  binds 
to  DNA  sequence  elements  called  E-boxes  (2).  c-Myc-Max  can 
subsequently  recruit  histone  acetyltransferase  (HAT)  activity  (3), 
chromatin  remodeling  complexes  (4),  or  promote  RNA  polymerase 
II  (RNAPII)  clearance  (5)  to  allow  for  target  gene  transcription. 
c-Myc-Max  can  also  repress  gene  transcription  primarily  by 
interfering  with  the  assembly  or  function  of  the  transcriptional 


Note:  Supplementary  data  for  this  article  are  available  at  Cancer  Research  Online 
(http:/ / cancerres.aacrjournals.org/). 
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complex  (6-8).  As  a  transcription  factor,  c-Myc  regulates  numerous 
gene  targets  that  subsequently  execute  its  many  biological 
activities,  including  cell  proliferation,  transformation,  angiogenesis, 
and  apoptosis  (9).  Identifying  these  target  genes  is  key  in 
elucidating  the  role  of  this  potent  oncogene  in  transformation 
and  has  thus  received  much  attention.  Despite  an  extensive  list  of 
c-Myc-regulated  genes,  it  remains  unclear  which  cohort  of  target 
genes  is  responsible  for  the  strong  transforming  activity  of  c-Myc 
(10,  11). 

Recent  analyses  using  advanced  high-throughput  chromatin 
immunoprecipitation  technology  has  revealed  the  nature  of  the 
target  genes  whose  promoter  regulatory  regions  are  bound  by 
c-Myc  (12-15).  Remarkably,  these  analyses  have  indicated  that 
c-Myc  target  genes  include  both  coding  and  noncoding  RNAs 
(ncRNA;  ref.  16).  Noncoding  RNAs  are  transcripts  expressed  and 
processed  in  the  nucleus  in  a  manner  similar  to  protein  coding 
genes;  however,  ncRNAs  lack  a  conserved  open  reading  frame. 
Although  elucidating  the  function  of  ncRNAs  is  in  the  early  stages 
of  investigation,  evidence  suggests  that  at  least  some  may  have 
roles  in  tumorigenesis  (17).  To  determine  the  role  of  c-Myc- 
regulated  ncRNAs  in  transformation,  we  investigated  both  the 
regulation  and  function  of  the  large  prototypic  ncRNA,  H19,  as  a 
downstream  target  of  c-Myc. 

H19  was  first  described  as  a  tumor  suppressor  (18,  19),  but  more 
recent  analysis  shows  that  H19  expression  is  reactivated  in  breast 
(20),  endometrial  (21),  lung  (22),  cervical  (23),  esophageal  (24),  and 
bladder  (25)  tumors.  The  H19 /insulin-like  growth  factor  2  ( IGF2 ) 
locus,  containing  both  the  H19  and  IGF2  genes,  is  subject  to 
genomic  imprinting,  which  leads  to  differential  allelic  expression  of 
H19  from  the  maternal  allele  and  IGF2  from  the  paternal  allele  (26). 
This  allele-specific  expression  is  highly  regulated  by  differential 
methylation  of  CpG  dinucleotides  that  are  usually  concentrated  in 
CpG  islands,  genomic  elements  that  are  often  located  close  to 
promoter  regions  (27).  As  our  data  indicated  that  c-Myc  can  bind 
to  intergenic  regions  containing  CpG  islands  (13),  we  also  explored 
the  consequences  of  allele-specific  CpG  methylation  on  the 
transcriptional  regulatory  function  of  c-Myc  at  the  H19/IGF2  locus. 

In  this  study,  we  show  that  c-Myc  induces  the  expression  of  the 
H19  ncRNA  and  binds  directly  to  E-boxes  close  to  the  imprinting 
control  region  (ICR).  Using  allele-specific  chromatin  immunopre¬ 
cipitation  analysis,  we  show  that  c-Myc  specifically  binds  and 
regulates  the  active  maternal  HI  9  allele  and  does  not  bind  or  affect 
the  expression  of  the  silenced  paternal  allele.  In  addition,  c-Myc 
down-regulates  transcription  of  the  reciprocally  imprinted  gene 
IGF2.  The  significance  of  H19  up-regulation  by  c-Myc  and  the 
association  of  c-Myc  and  H19  transcript  levels  were  assessed  in 
primary  and  established  tumor  cells  derived  from  breast  and  lung 
cancer  patients. 
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Materials  and  Methods 

Cell  lines.  The  immortal,  nontransformed  MCF10A  breast  cell  line  (gift 
from  Dr.  Muthuswamy)  was  cultured  as  described  (28).  Cells  were  grown 
with  10%  fetal  bovine  serum  in  ct-MEM  (glioblastoma  T98G),  DMEM 
H21  (rat  cardiomyocyte  H9C2  and  fibroblast  RatlMycERTAM),  McCoy’s 
(breast  cancer  cell  lines  MDA-MB231,  SKBR3,  and  colon  carcinoma  line 
HCT116),  and  RPMI  1640  (T47D  breast  cancer  cell  line  and  lung  cancer 
cell  lines  A549,  H460,  and  H520).  Rat  Myc  null  cells  (H015.19)  were  grown 
in  DMEM  H21  with  10%  calf  serum.  Where  indicated,  5  pmol/L  5-aza- 
deoxycytidine  (AzaC;  Sigma,  St.  Louis,  MO)  was  added  to  cells  every 
24  hours.  Trichostatin  A  (TSA;  Calbiochem,  La  Jolla,  CA)  was  used  at 
300  nmol/L. 

Retroviral  gene  transfer.  Ectopic  human  c-Myc  was  introduced  by 
infection  with  ecotropic,  replication-incompetent  retrovirus,  and  expression 
was  confirmed  as  described  (29). 

Isolation  and  analysis  of  RNA.  Total  RNA  was  isolated  as  previously 
described  (30)  and  purified  using  the  RNeasy  Mini  kit  (Qiagen,  Inc., 
Chatsworth,  CA).  Five  micrograms  of  total  cellular  RNA  were  reverse 
transcribed  using  Superscript  II  Reverse  Transcription  reagents  and 
01igoDT  (Invitrogen,  San  Diego,  CA).  Northern  blots  were  done  as 
described  (11). 

Gene  expression  analysis  in  cancer  cell  lines.  Semiquantitative  reverse 
transcription-PCR  (RT-PCR)  was  carried  out  as  described  (13)  with  primers 
provided  in  Supplementary  Table  SI,  and  the  conditions  are  available  upon 
request.  For  the  quantitative  real-time  RT-PCR  analysis  of  the  breast  and 
lung  cell  lines,  SYBR  Green  PCR  Master  Mix  (Applied  Biosystems,  Foster 
City,  CA)  were  used  according  to  the  manufacturer’s  instructions.  A  total  of 
one  thousandth  of  the  cDNA  reaction  was  used  for  each  PCR  triplicate.  The 
results  were  normalized  to  the  levels  of  the  36B4  transcript  using  the 
comparative  Ct  method.  The  allele-specific  PCR  employed  the  forward 
primers  specific  for  the  particular  HI  9  allele,  CGGCTCTCGAAGGTGAAGCT 
(B)  or  CGGCTCTCGAAGGTGAAGCG  (A),  and  the  reverse  primer  used  was 
TCGTGGAGGCTTTGAATCTCTCAG. 

Selection  of  patients.  The  RNA  expression  profiles  of  a  total  of  186 
breast  cancer  samples,  representing  137  distinct  tumors  from  the  cohort 
described  (31),  were  assayed  using  cDNA  expression  microarrays.  Two 
samples  lacked  follow-up  data  and  were  excluded  from  further  analysis.  A 
total  of  240  snap-frozen  non-small  cell  lung  (NSCLC)  carcinoma  samples 
were  harvested  from  patients  who  have  been  treated  primarily  by  surgical 
resection  at  the  University  Health  Network  from  1996  to  2000.  Tissues  were 
banked  with  informed  consent,  and  the  studies  have  been  approved  by  the 
institutional  Research  Ethics  Board. 

Analysis  of  breast  tumor  expression  data.  Raw  microarray  images 
were  quantitated  with  the  GenePix  (Axon,  Union  City,  CA)  software 
package  with  flagging  of  low-quality  spots.  Spot  signal  was  calculated  by 
subtracting  the  median  background  pixel  intensity  from  the  mean 
foreground  pixel  intensity.  Ratios  were  transformed  into  log2  space,  and 
missing  values  were  imputed  from  spotwise  duplicates  where  possible. 
Data  were  normalized  by  sequential  print-tip  loess  smoothing  (within 
array)  and  scale  adjustment  (between  arrays)  as  described  by  Yang  et  al. 
(32).  All  normalization  employed  the  limma  package  of  the  Bioconductor 
library  for  R  (v  2.0.1;  ref.  33).  Normalized  expression  values  were  centered 
using  a  15%  trimmed  mean.  To  assess  the  spot  quality,  we  determined  the 
correlation  of  the  two  sequences  representing  H19  (BI092679  and 
BQ028553)  and  MYC  (H43827  and  W87741).  Five  outliers  with  extreme 
H19  expression  were  identified  via  the  Q  test  and  removed,  leading  to 
higher  spot  correlation  (R  =  0.72).  Spots  were  then  collapsed  by  averaging. 
Similarly,  the  expression  from  two  MYC  spots  were  found  to  be  largely 
uncorrelated  ( R  =  —0.07);  thus,  the  clone  most  specific  to  MYC  was 
selected.  These  normalized  data  represented  137  distinct  tumor  samples, 
with  clinical  follow-up  available  for  135  of  these.  These  samples  were 
dichotomized  around  the  median  H19  expression  level.  A  t  test  with  the 
assumption  of  unequal  variances  was  used  to  test  MYC  expression  for 
significant  differences.  Normalized  array  data  for  all  spots  are  available  as 
Supplementary  Table  S2. 

Quantification  and  analysis  of  MYC  and  H19  expression  in  lung 
tumor  samples.  Real-time  quantitative  PCR  amplification  was  conducted 


using  the  SYBR  Green  assay  in  the  ABI  PRISM  7900-HT  (Applied 
Biosystems).  Each  10-pX  quantitative  RT-PCR  reaction  contained  a  2-ng 
equivalent  of  cDNA  in  a  384-well  plate.  The  reactions  were  activated  at 
95°C  for  3  minutes  followed  by  40  cycles  of  95°C  for  15  seconds,  65°C  for 
15  seconds,  and  72°C  for  20  seconds.  The  transcript  number/ng  cDNA  was 
obtained  using  standard  curves  generated  with  a  pool  of  10  nontumor  lung 
genomic  DNAs.  Primer  sequences  are  provided  in  Supplementary  Table  SI. 
Duplicate  RT  samples  were  used  in  each  assay.  Technical  replicates 
displayed  high  correlation  (i?avg  =  0.96  ±  0.04;  Supplementary  Fig.  SI)  and 
were  then  collapsed  through  averaging.  First,  expression  values  were  log2 
transformed  after  addition  of  a  pseudocount.  Samples  lacking  H19  or  TBP 
signal  were  removed  from  the  data  set.  For  each  of  the  remaining  samples,  a 
normalization  factor  was  calculated  using  the  mean  of  the  four 
housekeeping  genes  (TBP,  ACTB,  B2M,  and  BAT1)  and  used  to  remove 
nonbiologicalvariability.Forthe/thpatient,thenormalizationfactoris  Nt 

Nt  =  0.25  x  [A, (TBP)  +  A* (ACTB)  +  A,(B2M)  +  A,(BAT1)],  and  normalized 
expression  values  for  gene  M  are  Yi(M)  =  A \(M)  —  Nb  where  the  Xt  values 
correspond  to  unnormalized  expression  values.  For  each  patient  cohort,  the 
normalized  expression  values  were  median  centered  to  yield  the  final 
expression  estimates:  Z^M)  =  Yt(M)  —  median [Y(M)\.  The  two  patient 
cohorts  were  then  merged,  and  the  overall  data  set  of  240  distinct  sam¬ 
ples  was  dichotomized  around  the  median  H19  expression  level.  A  t  test 
with  the  assumption  of  unequal  variances  was  used  to  test  for  differential 
c-Myc  expression.  Raw  quantitative  RT-PCR  data  are  available  online  as 
Supplementary  Table  S3. 

Chromatin  immunoprecipitation.  Chromatin  immunoprecipitation 
was  done  as  previously  described  (13)  using  the  following  antibodies:  2 
pg  Myc  (Sc-764),  1  pg  CTCF,  0.5  pg  RNAPII  (Sc-764),  1  pg  Max,  all  from 
Santa  Cruz  Biotechnology  (Santa  Cruz,  CA),  and  0.5  pg  of  AcH3  or  AcH4 
(Upstate  Biotechnology,  Lake  Placid,  NY).  Real-time  PCR  was  done  as 
described  above  using  human  genomic  DNA  as  the  standard  and 
normalizing  the  specific  antibody  signal  to  the  input  signal.  Primer 
sequences  are  provided  in  the  Supplementary  Table  SI,  and  the  conditions 
are  available  upon  request.  The  same  chromatin  immunoprecipitation 
material  was  used  for  allele  specific  PCR  with  allele-specific  primers 
CGCCTACTTATGTGATGATCAG  or  CGCCTACTTATCTGATGATCAC  and 
the  reverse  GCACCCACGATAATGGATT. 

H19  knockdown.  The  small  interfering  RNAs  (siRNA)  were  designed 
using  the  SFold  web  site  (http://sfold.wadsworth.org).  The  control  siRNA 
was  against  luciferase  (34),  whereas  H19  hairpin  oligo  sequences  are 
CCGGGCGGGTCTGTTTCTTTACTTTCAAGAGAAGTAAAGAAACAGACC- 
CGCTTTTTG  and  reverse  AATTCAAAAAGCGGGTCTGTTTCTTTACTTCT- 
CTTGAAGTAAAGAAACAGACCCGC.  The  annealed  phosphorylated  oligos 
were  cloned  into  pLKOlpuro  lentiviral  vector  (gift  from  Drs.  Stewart, 
Novina,  and  Weinberg).  These  constructs,  together  with  packaging  vectors 
pMD.G,  pMDLg/pRRE  and  pRsv-Rev  (gift  from  Dr.  Naldini),  were  trans¬ 
fected  into  293TV  cells,  and  viral  supernatant  was  collected  48  hours  later 
and  used  to  infect  the  cells. 

Anchorage  and  clonogenicity  assays.  Anchorage-independent  growth 
assays  were  done  as  described  (29),  except  5,000  cells  were  seeded  and 
counted  at  the  end  of  a  2-week  period.  For  clonogenicity  assays,  500  cells 
were  seeded  in  six-well  dishes,  and  the  media  containing  puromycin  was 
changed  every  3  days.  After  the  1-  to  2- week  period,  the  resulting  colonies 
were  stained  with  2%  methylene  blue  in  50%  ethanol  and  counted. 

Results 

c-Myc  up-regulates  H19  and  down-regulates  IGF2  tran¬ 
scripts.  The  H19/IGF2  locus  is  subject  to  genomic  imprinting 
(Fig.  L4).  Allele-specific  methylation  of  CpG  dinucleotides  in  the 
ICR  leads  to  H19  expression  from  the  maternal  allele,  whereas  the 
reciprocally  imprinted  IGF2  gene  is  expressed  from  the  paternal 
allele  (26).  The  ICR  on  the  maternal  allele  is  unmethylated  and 
bound  by  CTCF,  a  zinc-finger  protein  that  acts  as  a  boundary 
between  the  enhancers  located  3'  of  H19  and  the  promoters  of 
IGF2.  The  paternal  allele  is  methylated  at  the  ICR,  preventing 
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Figure  1.  Ectopic  Myc  induces  the  expression  of  HI 9  and  represses  IGF2 
expression.  A,  schematic  of  the  H19/IGF2  locus,  where  active  promoters  are 
denoted  with  arrows  and  the  imprinting  control  region  ( gray  circles)  is  shown 
bound  by  CTCF  or  methylated  (CH3).  B,  HI  9  ncRNA  expression  in  MCF10A  and 
T98G  cells  with  (+)  and  without  (-)  ectopic  Myc  as  assessed  by  quantitative 
RT-PCR.  36B4  ribosomal  protein  mRNA  was  used  as  a  normalization  control. 
C,  IGF2  mRNA  levels  in  MCF10A  cells.  Quantitative  RT-PCR  was  conducted  in 
triplicates  twice.  Columns,  mean;  bars,  SD.  *,  P  <  0.05,  statistical  significance 
as  assessed  using  a  paired  t  test. 

CTCF  binding,  allowing  the  enhancers  to  potentiate  IGF2  tran¬ 
scription  (26). 

To  evaluate  whether  c-Myc  regulates  HI  9  expression,  we  intro¬ 
duced  ectopic  c-Myc  into  several  cell  types,  including  MCF10A  im¬ 
mortalized  nontransformed  mammary  epithelial  cells  and  T98G 
glioblastoma  cells  (Supplementary  Fig.  S2A).  Cells  with  ectopic 
c-Myc  expression  showed  7-  to  10-fold  up-regulation  of  H19  ncRNA 
expression  as  assessed  by  real-time  quantitative  RT-PCR  (Fig.  IB), 
semiquantitative  RT-PCR,  and  Northern  blotting  (Supplementary 
Fig.  S2B).  This  up-regulation  was  also  evident  in  diploid  fibro¬ 
blasts  WI38  (data  not  shown)  and  a  medulloblastoma  cell  line 
UW228  (see  Fig.  3 B).  The  activation  of  the  constitutively  expressed 
c-Myc/ estrogen  receptor  regulatory  region  (MycERTAM)  chimera  by 
4-hydroxytamoxifen  also  resulted  in  elevated  H19  expression  in 
Rat-1  fibroblasts  (Supplementary  Fig.  S2C). 

Given  the  strong  up-regulation  of  H19,  we  assessed  the  expres¬ 
sion  of  IGF2  in  response  to  exogenous  c-Myc  expression.  Interest¬ 
ingly,  c-Myc  down-regulated  IGF2  in  several  cell  systems,  such  as 
MCF10A  and  WI38  (Fig.  1C;  data  not  shown).  However,  in  T98G  cells, 
which  have  low  to  undetectable  levels  of  IGF2,  down-regulation 
of  IGF2  was  not  consistently  detectable  (data  not  shown).  This 
prompted  further  evaluation  in  rat  cardiomyocytes  that  have  high 
basal  levels  of  IGF2  transcripts.  c-Myc  robustly  repressed  the  levels 
of  IGF2  in  H9C2  cardiomyocytes  as  shown  by  Northern  blot 
(Supplementary  Fig.  S2D).  Thus,  c-Myc  strongly  up-regulates  H19 
and  down-regulates  IGF2  transcript  levels  in  several  cell  types. 

c-Myc  directly  binds  to  the  regulatory  regions  of  H19  and 
IGF2.  To  determine  whether  c-Myc  directly  regulates  H19,  we 
assessed  in  vivo  genomic  DNA  binding  of  c-Myc  to  the  regulatory 
region  of  H19  using  chromatin  immunoprecipitation,  focusing 
on  the  evolutionary  conserved  E-boxes  situated  1.5  and  3.1  kb 
upstream  from  the  transcription  start  site.  c-Myc  immunoprecipi- 
tates  were  highly  enriched  in  these  DNA  fragments  compared 
with  control  serum  immunoprecipitates  (Fig.  2 A,  primer  sets  1 
and  3).  The  ICR  region  containing  the  second  cluster  of  three 
DNase  hypersensitive  sites  essential  for  CTCF  binding  showed 


weaker  binding  of  c-Myc  (Fig.  2 A,  primer  set  2).  Interestingly, 
the  abovementioned  E-boxes  are  650  and  360  bp  away  from  the 
second  hypersensitive  site  cluster  within  the  ICR,  and  there  are 
also  numerous  noncanonical  E-boxes  interspersed  between  the 
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Figure  2.  Myc  binds  to  the  promoter  regions  of  HI 9  and  IGF2  to  regulate 
gene  transcription.  A,  Myc  binding  at  the  regulatory  region  of  the  HI 9  and  IGF2 
genes,  as  well  as  the  CAD  (positive  control)  and  chromosome  21  E-box 
(negative  control)  was  assessed  by  chromatin  immunoprecipitation  using 
Myc-specific  antibody  (Myc)  or  preimmune  serum  (Control)  in  MCF10A  cells 
expressing  ectopic  Myc.  Primers  are  indicated  by  arrows.  DNase-hypersensitive 
sites  (gray  circles)  comprise  the  ICR.  Relative  binding  was  normalized  to  the 
input  signal  and  expressed  as  0.001%  of  input.  Quantitative  PCR  was  conducted 
in  triplicates  twice.  Columns,  mean;  bars,  SD.  *,  P  <  0.05,  statistical  significance 
as  assessed  using  a  paired  t  test.  B,  the  transcriptional  components  bound 
at  the  H19  promoter  were  assessed  by  chromatin  immunoprecipitation  using 
primers  to  the  proximal  E-box  (equivalent  to  primer  set  3  in  A).  Rat  Myc-null 
cells  that  were  reconstituted  with  Myc  (Myc)  or  control  green  fluorescent 
protein  vector  (vector)  were  assayed  using  indicated  antibodies.  Chromatin 
immunoprecipitation  experiments  were  repeated  at  least  twice  with 
representative  gels  shown. 
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hypersensitive  sites.  Therefore,  from  the  binding  data  alone  it 
remained  unclear  whether  Myc  would  regulate  the  promoter  of 
H19  or  the  ICR  and  imprinting.  This  issue  is  addressed  through 
functional  analysis  below.  We  have  also  investigated  c-Myc  binding 
to  the  IGF2  gene  using  a  scanning  chromatin  immunoprecipitation 
approach.  IGF2  is  expressed  from  the  promoter  P4  in  MCF10A  cells 
(data  not  shown);  however,  c-Myc  binding  was  weak  or  undetect¬ 
able  not  only  at  the  P4  promoter  but  also  at  several  other  sites, 
including  the  differentially  methylated  region  1  and  the  coding 
region  in  the  exon  9  (Fig.  2 A,  primer  set  4;  data  not  shown).  Only 
the  E-box  5'  from  coding  exon  7  showed  binding  of  c-Myc  (Fig.  2 A, 
primer  set  5).  The  positive  control  CAD  promoter  was  highly 
enriched  in  c-Myc  immunoprecipitates,  whereas  the  negative 
control  E-box  at  chromosome  21  showed  no  significant  enrichment 
(Fig.  2 A).  Thus,  c-Myc  binds  to  the  regulatory  regions  of  H19  and 
IGF2  in  vivo. 

c-Myc  recruits  HATs  to  the  HI  9  promoter.  To  investigate 
evolutionary  conservation  and  further  assess  the  mechanism  of 
HI  9  transcriptional  regulation  by  c-Myc,  we  compared  rat 
fibroblast  cells  that  are  devoid  of  Myc  expression  with  those  that 
have  been  reconstituted  with  ectopic  c-Myc  for  promoter 
occupancy  by  chromatin  immunoprecipitation  analysis.  In  the 
absence  of  c-Myc,  Max  is  bound  to  the  H19  promoter  proximal 
E-box,  a  feature  previously  described  as  a  hallmark  of  c-Myc- 
regulated  genes  (Fig.  2 B,  top,  lane  7;  ref.  13).  Reintroducing  c-Myc 
into  these  cells  resulted  in  binding  of  c-Myc,  the  recruitment  of 
RNAPII  (Fig.  2 B,  top,  lanes  1  and  3)  and  an  increase  in  the 
acetylation  of  histone  H3  (AcH3)  and  H4  (AcH4)  at  the  H19  E-box 
(Fig.  2 B,  bottom,  lanes  1  and  2).  The  role  of  histone  acetylation  in 
c-Myc-mediated  transcriptional  induction  of  H19  was  further 
supported  by  evaluating  the  effect  of  the  histone  deacetylase 
(HDAC)  inhibitor  TSA.  Treatment  with  TSA  resulted  in  increased 
expression  of  H19  in  myc cells  as  detected  by  semiquantita- 
tive  RT-PCR  (Fig.  3 A  lanes  1-4).  Curiously,  the  level  of  H19  expres¬ 
sion  achieved  by  TSA  treatment  was  higher  in  myc cells  than  in 
c-Myc-expressing  cells  (Fig.  3 A  lattes  3  and  4).  Thus,  histone 
acetylation  activity  is  recruited  by  c-Myc  to  the  H19  promoter, 
leading  to  its  activation. 

c-Myc  does  not  affect  imprinting  of  the  H19/IGF2  locus. 

Because  the  paternal  allele  of  H19  is  usually  silent  due  to  ICR 
methylation  (see  Fig.  L4),  the  effect  of  DNA  methylation  on  c-Myc 
regulation  of  H19  and  IGF2  was  further  investigated.  HI  9 
expression  was  evaluated  in  myc  cells  and  c-Myc  reconstituted 
cells  in  the  presence  and  absence  of  DNA  methylation  inhibitor 
AzaC.  Exposure  to  AzaC  did  not  result  in  H19  up-regulation  in 
myc cells  (Fig.  3 A,  compare  lanes  1,  5,  and  7),  whereas  blocking 
DNA  methylation  increased  HI 9  expression  in  c-Myc-reconstituted 
cells  (Fig.  3 A,  compare  lane  2  with  lanes  6  and  8).  By  removing 
DNA  methylation  (AzaC)  and  then  increasing  histone  acetylation 
(TSA),  H19  expression  was  potentiated  in  myc cells  (Fig.  3 A, 
compare  lanes  1,  3,  and  9),  and  no  further  induction  by  c-Myc  was 
evident  under  these  conditions  (Fig.  3 A,  lanes  9  and  10).  Blocking 
DNA  methylation  also  did  not  affect  IGF2  basal  gene  expression 
(Fig.  3 A,  compare  lanes  1,  5,  and  7)  but  further  potentiated  c-Myc 
repression  of  this  target  gene  (Fig.  3 A,  compare  lanes  2,  6,  and  <S’). 
c-Myc  repression  of  IGF2  remains  intact  despite  treatment  with 
both  TSA  and  AzaC  (Fig.  34,  lanes  9  and  10).  These  data  suggest 
that  c-Myc  regulation  of  the  H19/IGF2  locus  does  not  involve  DNA 
methylation  or  imprinting.  Moreover,  loss  of  methylation  alone  is 
insufficient  to  alter  basal  gene  expression  but  can  cooperate  with 
c-Myc  to  potentiate  the  regulation  of  these  target  genes. 
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Figure  3.  Myc  binds  and  regulates  only  one  allele  of  HI 9.  A,  semiquantitative 
RT-PCR  of  HI 9  and  IGF2  in  Myc  null  rat  fibroblasts  expressing  the  control 
vector  (-)  or  ectopic  Myc  (+)  that  were  exposed  to  HDAC  inhibitor  TSA 
and/or  DNA  methylation  inhibitor  AzaC  for  the  time  indicated.  The  ribosomal 
protein  36B4  mRNA  serves  as  a  control.  Semiquantitative  RT-PCR  was 
conducted  twice  with  similar  results.  B,  allelic  expression  analysis  of  HI 9 
using  semiquantitative  RT-PCR.  C,  allele-specific  binding  by  Myc  and  CTCF 
in  MCF10A  cells  that  were  used  for  chromatin  immunoprecipitation  with 
Myc-specific  and  CTCF-specific  antibodies,  and  the  immunoprecipitates  were 
analyzed  with  allele-specific  primers.  PCR  analysis  was  conducted  thrice; 
representative  data  are  shown. 


c-Myc  up-regulates  only  one  allele  of  HI  9.  Because  the  H19/ 
IGF2  locus  is  subject  to  genomic  imprinting,  we  had  a  unique 
opportunity  to  investigate  the  transcriptional  regulatory  effect  of 
c-Myc  on  the  allelic  expression  of  H19.  Multiple  cell  lines  were  first 
tested  for  polymorphisms  in  the  H19  locus  (data  not  shown),  and 
the  allele-specific  primers  A  and  B  ( Alul  polymorphism)  were  then 
designed,  differing  by  a  3/  single  nucleotide.  The  primers  were 
tested  on  the  homozygous  MCF7  and  T98G  cells  to  show  allelic 
specificity  (Fig.  3 B,  lanes  8  and  9).  In  MCF10A  cells,  ectopic 
expression  of  c-Myc  induced  expression  of  H19  from  the  single 
allele  B  (Fig.  3 B,  lanes  1  and  2).  c-Myc  induction  of  both  alleles  of 
H19  was  evident  in  the  presence  of  AzaC,  showing  that  in  the 
absence  of  DNA  methylation,  c-Myc  can  access  and  activate  both 
alleles  of  H19  (Fig.  3 B,  lanes  3  and  4).  Similarly,  ectopic  expression 
of  increasing  amounts  of  c-Myc  activates  only  the  single  allele  A 
of  HI 9  in  a  dose-dependent  manner  in  Uw228  medulloblastoma 
cells  (Fig.  3 B,  lanes  5-7).  Therefore,  c-Myc  induces  H19  expression 
by  a  mechanism  that  is  restricted  to  only  one  allele  of  H19. 
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c-Myc  binds  to  the  promoter  of  one  H19  allele.  To  further 
evaluate  the  molecular  mechanism  of  c-Myc  induced  allele-specific 
expression  of  HI 9,  we  queried  whether  c-Myc  binds  to  only  one  or 
both  alleles  of  HI 9  in  vivo.  Allele-specific  chromatin  immunopre- 
cipitation  primers  were  tested  on  control  DNA  from  homozygous 
HCT116  and  D341  cell  lines  to  ensure  that  only  one  allele  is 
recognized  using  this  approach  (Fig.  3 C,  lanes  5  and  6).  Both  c-Myc 
and  CTCF  are  bound  to  allele  B  but  not  allele  A  in  MCF10A  cells 
(Fig.  3 C,  compare  lanes  1-3  and  lanes  2-3).  The  weaker  signal  for 
c-Myc  binding  could  be  due  to  the  allele-specific  primers  being 
situated  0.4  kb  away  from  the  E-box.  The  bound  allele  was 
designated  as  allele  B  (Fig.  35)  because  it  has  been  reported  that 
CTCF  binds  to  the  maternally  derived,  expressed  allele  of  H19  (26). 
Indeed,  allele  B  of  H19  is  bound  by  c-Myc  and  CTCF  (Fig.  3C,  lanes 
1  and  2),  whereas  allele  A  is  not  bound  by  either  c-Myc  or  CTCF, 
nor  it  is  sensitive  to  induced  expression  in  response  to  ectopic 
c-Myc  in  MCF10A  cells.  These  allelic  binding  results  were  also 
confirmed  by  methyl-specific  PCR  (data  not  shown).  Thus,  c-Myc 
and  CTCF  bind  to  the  maternally  derived,  expressed  allele  of  HI  9. 

H19  knockdown  inhibits  tumorigenic  properties  of  breast 
and  lung  cancer  cells.  To  assess  the  role  of  H19  in  transformation, 
we  evaluated  the  expression  and  functional  significance  of  H19  in 
cells  derived  from  breast  and  lung  carcinomas.  H19  has  recently 
been  reported  to  be  elevated  in  a  high  proportion  of  these  tumor 
types  (20,  22).  H19  expression  was  evident  in  the  SKBR3  and  T47D 
breast  cancer  cell  lines  and  the  lung  adenocarcinoma  A549  cells, 
whereas  expression  was  undetectable  in  MDA-MB231  breast  cancer 
cell  line  (Fig.  4 A).  Stable  siRNA-mediated  knockdown  of  H19 
resulted  in  significant  decrease  in  both  clonogenicity,  as  assessed 
by  the  efficacy  of  colony  formation  on  solid  support  (Fig.  45),  and 
anchorage-independent  growth,  as  assessed  by  colony  formation 
in  soft  agar  (Fig.  4 C).  The  inhibition  of  both  clonogenicity-  and 
anchorage-independent  growth  in  response  to  H19  knockdown  was 
significant  in  all  cell  lines  with  H19  expression  (SKBR3,  T47D,  and 
A549),  and  insignificant  in  cells  with  undetectable  basal  H19 


expression,  such  as  MDA-MB231  cells  or  H460  and  H520  lung 
carcinoma  cell  lines  (Fig.  45  and  C;  data  not  shown).  A  lack  of 
biological  effect  of  H19  siRNA  in  MDA-MB231,  H460,  and  H520 
serves  as  an  important  specificity  control.  Three  of  five  siRNAs 
designed  worked  well  to  down-regulate  H19  expression;  however, 
only  one  was  specific  (data  not  shown).  Another  siRNA  we  iden¬ 
tified  as  a  nonspecific  H19  siRNA  was  previously  used  to  transiently 
down-regulate  H19  (35).  Thus,  H19  contributes  to  the  clonogenic 
and  anchorage-independent  growth  properties  in  breast  and  lung 
cancer  cells  with  reactivated  H19  expression. 

H19  regulation  of  IGF2.  As  H19  has  been  reported  to  repress 
the  levels  of  IGF2,  we  also  monitored  the  effect  of  H19  knockdown 
on  IGF2  levels.  No  significant  and  consistent  increase  in  IGF2  ex¬ 
pression  upon  H19  repression  was  noted  in  any  of  the  breast  cancer 
cells  (Fig.  4 D)  or  MCF10A  cells  expressing  ectopic  c-Myc  and  ele¬ 
vated  H19  (data  not  shown).  Only  the  A549  cells  with  H19  knock¬ 
down  displayed  increased  IGF2  transcript  expression  (Fig.  4 D). 

c-Myc  binds  to  the  HI  9  promoter  in  SKBR3  but  not  MDA- 
MB231  cells.  To  further  evaluate  the  mechanism  of  c-Myc 
induction  of  H19  transcription,  we  queried  whether  c-Myc  was 
bound  to  the  H19  E-box  (—1.5  kb)  in  cells  with  (SKBR3)  and 
without  (MDA-MB231)  basal  H19  expression.  Quantitative  chro¬ 
matin  immunoprecipitation  analysis  showed  that  c-Myc,  CTCF, 
RNAPII,  and  AcH4  were  present  at  the  HI  9  E-box  in  SKBR3 
compared  with  control  antibody  (Fig.  5 A).  However,  in  MDA- 
MB231  cells,  binding  of  c-Myc  and  RNAPII  was  not  evident, 
whereas  the  binding  of  CTCF  was  reduced,  and  AcH4  was  unaf¬ 
fected  (Fig.  5 A).  The  positive  control  E-box  (CAD)  bound  c-Myc  and 
RNAPII  in  both  cell  lines,  whereas  the  negative  control  E-box  on 
chromosome  21  had  no  significant  binding  of  any  of  the  factors 
(Fig.  55  and  C).  Therefore,  c-Myc  binds  to  the  promoter  region  of 
H19  in  SKBR3  cells  and  contributes  to  the  up-regulation  of  H19. 

High  expression  levels  of  H19  correlate  with  elevated  levels 
of  c-Myc  mRNA  in  breast  and  lung  cancer  patients.  To  deter¬ 
mine  if  c-Myc  regulation  of  H19  is  a  feature  of  primary  human 
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Figure  4.  HI 9  siRNA  inhibits  cellular 
transformation.  A,  quantitative  RT-PCR 
assessment  of  the  basal  HI 9  expression 
and  efficacy  of  HI 9  siRNA.  36B4  mRNA 
was  used  for  normalization.  The  graph  of 
SKBR3  is  illustrated  separately  due  to  the 
very  high  HI 9  expression.  Quantitative 
RT-PCR  was  conducted  twice  in  triplicate. 
Columns,  mean;  bars,  SD.  *,  P  <  0.05, 
statistical  significance  as  assessed  using  a 
paired  t  test.  B,  clonogenicity  is  decreased 
in  response  to  the  ectopic  expression 
of  HI 9  siRNA  in  cell  lines  with  basal 
HI 9  expression.  The  experiment  was 
conducted  two  independent  times  with 
consistent  results;  a  representative 
experiment  is  shown.  Experimental  results 
were  normalized  between  the  cell  lines. 
Columns,  mean;  bars,  SD.  *,  P  <  0.05, 
statistical  significance  as  assessed  using  a 
paired  t  test.  C,  anchorage-independent 
growth  is  decreased  in  response  to 
HI 9  siRNA.  Data  were  analyzed,  and 
significance  test  was  done  as  in  (B). 

D,  quantitative  RT-PCR  analysis  of  IGF2 
expression  was  conducted  as  in  A. 
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Figure  5.  Transcriptional  complex  binding 
at  the  H19  promoter  in  SKBR3  and 
MDA-MB231  cells.  A,  chromatin 
immunoprecipitation  and  quantitative  PCR 
analysis  of  transcriptional  components 
binding  the  H19  E-box.  Left,  Myc  and 
RNAPII  binding  relative  to  control;  right, 
CTCF  and  AcH4  binding.  B,  chromatin 
immunoprecipitation  and  quantitative  PCR 
analysis  of  Myc  and  RNAPII  binding  at  the 
positive  control  CAD  in  both  cell  lines. 

C,  chromatin  immunoprecipitation  and 
quantitative  PCR  analysis  of  negative 
control  E-box  binding  on  chromosome  21 . 
Real-time  quantitative  PCRs  were  done 
and  analyzed  as  in  Fig.  2.  *,  P  <  0.001, 
statistical  significance  as  assessed  using 
a  f  test. 
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tumors,  we  obtained  expression  estimates  for  both  genes  from  a 
large  microarray  study  of  137  node-negative  breast  cancer  cases. 
Following  data  normalization  and  centering,  patients  were 
dichotomized  into  two  equal-sized  groups  based  on  H19  expres¬ 
sion  levels  (Fig.  6).  Using  a  two-tailed  heteroscedastic  t  test,  we 
show  that  c-Myc  expression  was  significantly  higher  (P  =  0.009) 
in  breast  tumors  with  high  H19  levels  than  those  showing  lower 
H19  expression.  To  show  that  this  is  a  feature  of  other  tumor  types, 
we  characterized  the  relationship  between  H19  and  c-Myc 
expression  in  lung  cancers  using  quantitative  RT-PCR  in  a  panel 
of  240  NSCLCs.  Following  normalization  to  a  battery  of  four 


Figure  6.  The  expression  of  HI 9  and  c-Myc  are  related  in  both  breast  (n  =  137) 
and  lung  (n  =  240)  tumor  samples.  For  each  tumor  type,  tumors  were 
dichotomized  into  two  equal-sized  groups  based  on  HI 9  expression,  and  c-Myc 
expression  was  compared  between  the  high  and  low  HI 9  cohorts.  *,  P  <  0.01, 
significance  by  a  two-tailed  t  test.  Columns,  mean;  bars,  95%  confidence 
intervals. 


housekeeping  genes,  we  again  dichotomized  patients  into  two 
equal-sized  groups  based  on  H19  expression  levels  (Fig.  6).  Using  a 
two-tailed  heteroscedastic  t  test,  we  again  found  that  c-Myc  expres¬ 
sion  was  significantly  higher  (P  =  0.002)  in  tumors  with  elevated  H19 
levels  than  those  showing  lower  H19  expression.  To  ensure  that  our 
results  are  not  artifacts  of  the  statistical  analysis,  we  verified  the 
results  using  a  two-log  threshold  with  the  hypergeometric  test  and 
again  found  statistical  significance  (P  =  0.01).  Both  boxplot  and 
spikeplot  representations  of  the  data  are  available  as  Supplemen¬ 
tary  Figs.  S3  and  S4.  These  analyses  show  positive  correlation  of 
high  H19  levels  with  elevated  c-Myc  mRNA  levels.  Selected  clinical 
information  is  available  in  Supplementary  Tables  S4  and  S5. 

Discussion 

Based  on  allelic  expression  and  genomic  binding  studies,  we 
show  that  c-Myc  directly  binds  to  the  H19  promoter  and  highly 
up-regulates  the  transcription  of  the  maternal  HI  9  allele  by 
recruiting  HAT  activity.  We  further  show  that  H19  knockdown,  in 
a  panel  of  breast  and  lung  cancer  cell  lines,  results  in  the 
reduction  of  their  tumorigenic  phenotype  as  shown  by  foci 
formation  and  anchorage-independent  growth  assays.  Indeed,  a 
strong  association  between  c-Myc  and  H19  transcript  levels  was 
evident  in  both  primary  breast  and  lung  cancer  patient  material. 
Taken  together,  these  results  indicate  that  Myc  up-regulation  of 
H19  strongly  contributes  to  the  tumorigenic  phenotype  of  breast 
and  lung  cancer  cells. 

Our  study  shows  that  c-Myc  binds  to  the  conserved  E-boxes  at 
the  HI 9  promoter  close  to  the  ICR  and  up-regulates  the  expression 
of  this  ncRNA  in  MCF10A  breast  epithelial  and  established  rat 
fibroblast  cell  lines.  The  physical  boundary  between  the  ICR  and 
the  promoter  of  H19  remains  undefined;  thus,  we  investigated  the 
mechanism  of  regulation  and  show  that  Myc  does  not  affect  ICR 
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function  or  imprinting  of  this  locus,  as  H19  remains  expressed  from 
one  allele  only.  In  a  parallel  study,  Lee  et  al.  also  found  that  c-Myc 
up-regulates  H19  in  mouse  liver  in  vivo.s  However,  their  data 
indicate  the  strongest  binding  of  Myc  at  an  enhancer  downstream 
of  the  H19  promoter  in  hepatocytes.  We  did  not  see  significant 
binding  of  Myc  to  this  region  in  MCF10A  cells  expressing  ectopic 
Myc  (data  not  shown).  The  distinct  binding  sites  and  possibly 
induction  mechanisms  used  by  Myc  in  human  and  rat  transformed 
cells  compared  with  murine  nontransformed  hepatocytes  is 
intriguing  and  likely  is  a  consequence  of  cell  type  differences. 
Importantly,  only  the  maternal  allele  of  H19  was  regulated  by  Myc 
in  both  studies. 

Our  results  show  that  c-Myc  binds  and  regulates  the  maternal 
HI  9  allele  strongly  supporting  the  observation  that  the  c-Myc/Max 
complex  is  unable  to  bind  methylated  E-boxes  in  vitro  (36). 
While  this  article  was  in  preparation,  N-Myc  was  also  reported  to 
selectively  bind  to  unmethylated  E-boxes  (37).  This  suggests  that 
E-box  methylation  is  a  determinant  of  c-Myc  and  N-Myc  as 
regulators  of  gene  transcription.  This  is  further  supported  by  our 
results  showing  that  the  silenced  paternal  allele  did  not  display 
binding  or  regulation  by  Myc  in  MCF10A  cells,  whereas  removal  of 
DNA  methylation  leads  to  induction  of  the  paternal  allele  by  c-Myc. 
These  data  suggest  that  changes  in  the  DNA  methylation  patterns, 
which  are  common  in  cancer  (38),  could  alter  transcriptional 
profiles  and  thus  modulate  the  biological  function  of  Myc. 

Our  results  indicating  that  c-Myc  binds  and  activates  HI  9  in  an 
allele-specific  manner  suggests  that  additional  transcription  factors, 
as  well  as  Myc,  may  regulate  transcription  with  allelic  preference. 
Recent  analyses  have  shown  that  allelic  variation  in  gene  expression 
is  common  even  in  the  nonimprinted  genes  in  mammalian  genomes 
and  may  affect  20%  to  50%  of  human  genes  (39).  We  also  show  that 
the  main  function  of  Myc  at  the  H19  promoter  is  to  recruit  FLAT 
activity  and  RNAPII  for  transcriptional  initiation.  In  cells  lacking 
Myc,  basal  H19  expression  was  induced  only  in  response  to  HDAC 
inhibition  but  not  to  the  inhibition  of  DNA  methylation.  Similarly, 
the  silencing  of  pl6INKIA  can  also  occur  in  the  absence  of 
DNA  methylation  through  histone  modifications  (40).  Therefore, 
Myc  binds  the  ICR  of  HI 9  in  an  allele-specific  manner  to  induce 
transcription  through  the  recruitment  of  HAT  activity. 

Furthermore  c-Myc  down-regulates  IGF2  transcripts  in  cell  lines 
with  detectable  IGF2  mRNA.  Mechanistically,  our  data  indicate 
that  Myc  binding  is  restricted  to  the  maternal  allele  ICR  and  is 
unlikely  to  have  an  effect  on  IGF2  expression  from  the  paternal 
allele.  Although  it  has  been  reported  that  H19  can  repress  IGF2 
expression  (41),  we  did  not  consistently  observe  this  cross¬ 
regulation  in  the  cell  systems  studied,  suggesting  this  potential 
regulatory  mechanism  may  be  a  cell-dependent  and/or  tumor 
type-dependent  phenomenon.  Surprisingly,  our  data  indicate  that 
Myc  binds  the  E-box  in  the  first  intron  of  IGF2  to  repress 
transcription.  Most  Myc  repressed  genes  analyzed  to  date  are 
regulated  through  initiator  or  proximal  promoter  regions  (6-8). 
It  remains  to  be  determined  whether  E-box-dependent  Myc 
repression  is  evident  and  unique  to  IGF2  or  functional  at  multiple 
loci.  Moreover,  it  remains  unclear  whether  this  potentially  novel 
mechanism  is  best  captured  through  genomic  analysis  in  vivo  and 
not  detectable  by  conventional  transient  indicator  gene  analysis 
methods.  Biologically,  gene  expression  analysis  of  a  series  of 
neuroblastoma  cell  lines  indicated  that  high  N-Myc  levels 
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correlate  with  low  IGF2  levels  (42).  However,  colon  carcinomas 
display  high  levels  of  Myc  and  loss  of  IGF2  imprinting  (expressed 
from  both  alleles)  that  is  sometimes  accompanied  by  elevated 
levels  of  IGF2  (43).  This  implies  that  the  repression  of  IGF2  is  cell 
specific  or  is  lost  in  colon  cancer  cells.  At  this  time,  the 
significance  of  IGF2  down-regulation  by  Myc  remains  unclear.  As 
a  mitogen  in  many  cell  types,  IGF2  is  part  of  a  signaling  cascade 
that  is  able  to  induce  Myc  and  Myc  repression  of  IGF2  could 
function  as  part  of  negative  feedback  loop  to  control  mitogen 
stimulation.  A  similar  function  has  been  proposed  for  Myc 
repression  of  the  platelet-derived  growth  factor-(3  receptor  (7). 

The  physiologic  role  of  H19  ncRNA  seems  to  be  restricted  to  the 
time  of  embryonic  expression  (44),  and  its  pathologic  role  has  only 
recently  been  investigated.  The  introduction  of  H19  into  some  cell 
lines  resulted  in  anchorage-independent  growth  suppression,  thus 
earning  H19  tumor  suppressor  designation  (18,  19).  Recently, 
however,  H19  expression  has  been  shown  to  be  reactivated  in  a 
variety  of  tumors  (20-25).  Ectopic  expression  of  H19  in  MDA- 
MB231  and  T24P  cells  increased  their  growth  and  tumorigenicity 
(45,  46),  whereas  the  subsequent  knockdown  of  H19  in  MDA- 
MB231  reduced  cell  proliferation  (35).  We  show  that  the  knock¬ 
down  of  H19  in  SKBR3,  T47D,  and  A549  cells  leads  to  the  reduction 
of  their  clonogenic  ability  and  decreased  anchorage-independent 
growth.  The  analysis  of  H19  and  c-Myc  expression  in  primary 
breast  and  lung  tumor  samples  indicated  that  high  H19  levels  are 
strongly  associated  with  high  Myc  transcript  levels.  This  strong 
association  in  primary  tissue,  in  combination  with  the  essential 
role  of  H19  in  transformation,  suggests  that  Myc-induced  H19 
expression  contributes  to  both  tumor  etiology  and  Myc’s  function 
as  an  oncogene. 

The  function  of  H19  remains  enigmatic.  Although  knockout  H19 
mice  seem  grossly  normal  (47),  we  and  others  show  that  H19  plays 
a  role  in  the  tumorigenic  phenotype  (35,  45,  46).  Based  on  the 
physical  association  of  the  H19  transcript  with  polysomes,  the 
mechanism  of  H19  action  is  thought  to  be  at  the  level  of  trans¬ 
lational  regulation  (48).  In  addition,  thioredoxin,  a  modulator  of 
signal  transduction  and  potentiator  of  tumorigenesis,  was  re¬ 
cently  identified  as  translationally  up-regulated  by  H19  (49).  Fur¬ 
ther  studies  will  undoubtedly  elucidate  other  H19-regulated 
molecules  and  the  role  of  H19  in  both  physiologic  and  patho¬ 
logic  settings.  As  more  transcriptional  regulators  of  H19  are 
determined,  H19  regulation  and  function  will  also  be  better 
understood.  Although  this  work  was  in  progress,  E2F1  was  shown 
to  regulate  H19  (35),  which  suggests  that  Myc  and  E2F1  can  exert  a 
positive  combinatorial  control  over  H19  transcription.  It  is  not¬ 
able  that  genes  bound  by  Myc  (12-15)  and  E2F1  (50)  display  a 
considerable  overlap. 

We  have  thus  identified  HI  9  as  a  Myc-up-regulated  gene  that 
potentiates  the  tumorigenic  phenotype  of  breast  and  lung  can¬ 
cer  cells.  Complex  interactions  between  DNA  methylation  and  his¬ 
tone  modifications  regulate  H19  expression,  where  the  role  of  Myc 
is  to  recruit  HAT  activity  to  unmethylated  E-boxes  and  initiate 
allele-specific  H19  transcription. 
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Trial  watch 


MANY  USES  FOR  LAPATINIB 


At  the  recent  American  Society  of  Clinical  Oncology  annual  conference, 
positive  clinical  trial  results  with  lapatinib  in  various  cancer  types  were 
presented.  Lapatinib  is  a  dual  kinase  inhibitor  that  targets  the  epidermal- 
growth-factor  receptors  1  (ECFR  or  ERBB1)  and  2  (HER2  or  ERBB2). 

A  phase  III  open-label  study  reported  by  Charles  Geyer  was  designed  to 
evaluate  time  to  progression  (TTP)  in  women  with  ERBB2-positive 
trastuzumab-resistant  advanced  breast  cancer  treated  with  lapatinib  plus 
capecitabine,  or  capecitabine  alone.  Of  321  women  evaluated,  the  median  TTP 
in  the  combination  group  was  36.9  weeks,  compared  with  19.7  weeks  in  the 
monotherapy  group.  Median  progression-free  survival  was  36.9  weeks  on 
combination  therapy  versus  17.9  weeks  on  capecitabine  alone.  Survival  data 
are  not  yet  mature,  but  are  similar  so  far.  The  superior  efficacy  of  the 
combination  therapy  has  led  the  Independent  Data  Monitoring  Committee  to 
recommend  discontinuing  enrolment  into  the  trial. 

In  addition,  Neil  Spector  reported  the  results  of  an  ongoing  phase  II 
study  of  lapatinib  monotherapy  for  women  with  relapsed  or  refractory 
inflammatory  breast  cancer.  62%  of  ERBB2  overexpressors  had  a  complete 
or  partial  response;  there  were  no  responders  in  the  group  who 
overexpressed  ERBB1  but  not  ERBB2.  These  data  are  based  on  only  57 
patients,  but  they  indicate  that  ERBB1  expression  alone  cannot  predict 
sensitivity  to  lapatinib. 

Another  phase  III  trial  was  conducted  in  patients  with  advanced  renal¬ 
cell  carcinoma  (RCC)  —  a  tumour  known  to  produce  high  levels  of  ERBB1. 

In  this  study,  417  patients  with  immunotherapy-refractory  RCC  were 
randomized  to  receive  standard  second-line  hormonal  therapy  or  lapatinib. 
Alain  Ravaud  reported  that  while  overall  survival  and  TTP  were  similar, 
those  241  patients  who  greatly  overexpressed  ERBB1  had  longer  TTP  if  on 
lapatinib  rather  than  hormonal  therapy  (15.1  weeks  versus  10.9  weeks),  as 
well  as  overall  survival  (46  weeks  versus  37.9  weeks).  Follow-up  in  this  trial 
is  continuing. 
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FOCUSING  ON  AFRICAN-AMERICAN  WOMEN 


African-American  women  with  breast  cancer  are  known  to  have  a  poorer 
prognosis  than  non-African-American  women.  A  case-control  study  reported 
in  JAMA  is  unique  in  that  it  oversampled  African-American  and  pre¬ 
menopausal  women  to  enable  better  evaluation  of  these  populations  in  terms 
of  tumour  subtype  and  survival  than  has  previously  been  possible. 

496  incident  cases  of  invasive  breast  cancer  were  included  and  subtyped 
into  luminal,  basal-like,  ERBB2  positive  and  oestrogen-receptor  negative  ( E R  ) , 
and  unclassified.  Basal-like  breast  cancer  was  the  most  prevalent  subtype 
among  pre-menopausal  African-American  women  (39%)  compared  with 
postmenopausal  African-American  women  (14%)  or  non-African-American 
women  of  any  age  (16%).  Conversely,  the  luminal  subtype  was  less  prevalent. 
There  was  no  difference  within  the  ERBB2+ER~  subtype  with  regards  to  race  or 
menopausal  status.  The  overall  disease-specific  survival  was  80%,  but  the 
shortest  survival  was  among  the  ERBB2+ER~  (52%)  and  basal-like  subtypes 
(75%).  These  findings  could  contribute  to  the  poor  prognosis  of  young  African- 
American  women  with  breast  cancer,  but  additional  studies  of  long-term 
survival  in  other  African-American  populations  are  required  to  confirm  this. 
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3  TUMORIGENESIS 

The  silent  messenger 

Various  genes  are  known  to  be  regulated  by  the  transcription 
factor  MYC,  but  the  genes  that  are  crucial  for  its  tumorigenic 
qualities  are  not  clearly  defined.  Linda  Penn  and  colleagues 
have  found  that  a  non-coding  RNA,  H19,  is  a  transcriptional 
target  of  MYC,  and  the  inhibition  of  H19  transcription  in 
cancer  cells  suppresses  certain  cellular  characteristics  that  are 
associated  with  oncogenic  transformation. 

Despite  being  first  identified  as  a  tumour  suppressor, 
increased  expression  of  H19  is  seen  in  various  human 
cancers,  including  lung  and  breast  cancer.  H19  resides  at  the 
same  locus  as  the  insulin-like  growth  factor  2  gene  (IGF2), 
and  both  of  these  genes  are  subject  to  genomic  imprinting 
—  only  the  maternal  allele  of  H19  and  the  paternal  allele  of 
IGF2  are  expressed.  Through  the  combined  use  of  reverse 
transcriptase  (RT)-PCR,  northern  blotting  and  allele-specific 
chromatin  immunoprecipitation  techniques  the  authors 
were  able  to  show  that  MYC  binds  to  the  regulatory  regions 
of  the  H19  and  IGF2  genes.  Furthermore,  MYC  only  binds  to 
the  non-methylated  promoter  of  the  maternal  allele  of  H19 
to  induce  transcription,  which  indicates  that  MYC  does  not 
alter  the  imprinting  of  this  gene  and  that  MYC  target  genes 
are  determined  by  promoter  methylation. 

Small  interfering  RNAs  that  target  H19  in  cells  derived 
from  lung  and  breast  tumours  inhibited  anchorage- 
independent  growth  and  colony  formation  in  vitro  —  two 
well-known  characteristics  of  transformed  cells.  But  is  there 
a  correlation  between  the  level  of  MYC  expression  and  H19 
expression  in  these  tumours?  The  authors  undertook  a  large 
microarray  study  using  137  node-negative  breast  cancer 
samples.  Statistical  analyses  showed  that  expression  levels  of 
MYC  were  significantly  higher  in  samples  with  high  levels  of 
H19  expression.  Similar  results  were  obtained  using  240  non- 
small-cell  lung  cancer  samples,  which  were  analysed  using 
real-time  RT-PCR. 

The  function  of  non-coding  RNAs  is  unclear,  but  it  is 
thought  that  the  H19  transcript  might  regulate  RNA 
translation.  So,  MYC  might  indirectly  regulate  the  expression 
of  a  number  of  proteins  in  tumour  cells  through  the  increased 
expression  of  H19. 

Nicola  McCarthy 
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Abstract 

The  potent  Myc  oncoprotein  plays  a  pivotal  role  as  a  regulator  of  tumorigenesis  in  numerous  human  cancers  of  diverse  origin. 
Experimental  evidence  shows  that  inhibiting  Myc  significantly  halts  tumour  cell  growth  and  proliferation.  This  review  summarises 
recent  progress  in  understanding  the  function  of  Myc  as  a  transcription  factor,  with  emphasis  on  key  protein  interactions  and  target 
gene  regulation.  In  addition,  major  advances  in  drug  development  aimed  at  eliminating  Myc  are  described,  including  antisense  and 
triple  helix  forming  oligonucleotides,  porphyrins  and  siRNA.  Future  anti-Myc  strategies  are  also  discussed  that  inhibit  Myc  at  the 
level  of  expression  and/or  function.  Targeting  the  dark  side  of  Myc  with  novel  therapeutic  agents  promises  to  have  a  profound 
impact  in  combating  cancer. 

©  2005  Elsevier  Ltd.  All  rights  reserved. 
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1.  Introduction 

These  are  exciting  times  for  clinicians  and  scientists 
working  at  the  front  line  of  cancer  research.  In  recent 
years,  our  understanding  of  the  genetic  etiology  and 
molecular  dependence  of  cancer  has  advanced  to  the 
point  where  we  can  begin  to  exploit  this  knowledge  for 
the  design  of  novel,  effective,  anti-cancer  therapeutics. 
Several  new  classes  of  anti-cancer  agents  have  been 
developed  to  target  pathways  that  are  essential  for  can¬ 
cer  cell  growth  and  survival.  To  date,  the  effort  has  been 
primarily  focused  at  targeting  growth  factor  receptors 
which  has  led  to  achievements,  such  as  Gleevec  (Imati- 
nib,  STI-571),  a  small  molecular  weight  kinase  inhibitor, 
and  Trastuzumab  (Herceptin),  an  antibody  reagent  that 
targets  the  HER-2/neu  receptor  [1-4],  These  agents 
highlight  the  merits  of  molecular  targeting  and  provide 
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an  important  proof-of-concept  that  targeted  therapeu¬ 
tics  can  be  successfully  developed  and  applied  to  patient 
care.  However,  activating  mutations  can  occur  at  multi¬ 
ple  independent  points  along  oncogenic  signalling  cas¬ 
cades,  so  targeting  cell  surface  receptors  restricts  the 
range  of  potentially  sensitive  tumours.  Downstream 
transcription  factors  that  directly  control  the  transfor¬ 
mation  program  provide  an  alternative  target  that  may 
be  activated  in  a  broader  spectrum  of  cancers.  The 
Myc  transcription  factor  is  one  of  the  most  potent  and 
frequently  deregulated  oncoproteins  in  human  cancers 
[5,6].  Multiple  extracellular  and  intracellular  signalling 
cascades  converge  to  regulate  the  Myc  oncogene 
(Fig.  1)  making  it  an  especially  attractive  target  in  the 
control  of  transformation. 

In  this  review,  we  focus  on  the  most  recent  advances 
in  understanding  the  molecular  mechanisms  of  Myc 
function  in  the  etiology  of  human  cancers.  Special 
emphasis  is  placed  on  Myc  interactions  with  other  pro¬ 
teins  and  its  gene-regulatory  mechanisms.  As  well,  we 
provide  an  overview  of  the  past,  present  and  potential 
future  strategies  used  to  target  Myc. 
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Fig.  1.  The  regulatory  network  of  Myc.  Myc  deregulation  is  a 
hallmark  of  many  cancers  and  occurs  as  a  consequence  of  activation  of 
one  or  more  signalling  pathways  that  induce  Myc  expression  and 
function  as  a  regulator  of  gene  transcription.  These  include  mitogen 
activated  protein  kinases  (MAPK)  [192-194],  phosphatidylinositol-3 
kinase  (PI3K)  [195,196],  wnt-TCF/LEF  pathway  [197],  and  signal 
transducer  and  activator  of  transcription  (STAT)  [198]  pathways.  The 
target  genes  regulated  by  Myc  orchestrate  the  many  biological 
activities  attributed  to  Myc,  including  apoptosis,  genomic  instability, 
growth,  transformation,  proliferation,  angiogenesis  and  blocking 
differentiation. 


1.1.  Myc  expression  and  biological  activities 

Members  of  the  myc  family  that  show  oncogenic 
activity  in  human  cancers  include  c-myc,  MYCN  and 
MYCL1.  In  a  normal  physiological  state,  the  protein 
products  of  all  three  genes  are  expressed  during  fetal 
development,  whereas  only  c-Myc  protein  is  expressed  in 
adult  tissues.  c-Myc  protein  levels,  in  non-transformed 
cells,  are  induced  or  suppressed  by  virtually  all  signalling 
cascades  bearing  proliferative  and  anti-proliferative 
cues,  respectively.  Mitogen  stimulation  induces  c-Myc 
as  an  immediate-early  response  gene,  whose  expression 
is  essential  and  sufficient  for  Gl/S  progression  [7-9]. 
c-Myc  also  plays  a  role  in  G2/M  transition,  making  it 
one  of  the  key  players  in  cell  cycle  regulation  [10].  As 
such,  it  is  important  that  c-myc  mRNA  and  protein  have 
a  very  short  half-life  (20-30  min)  and  are  tightly  regu¬ 
lated.  c-Myc  expression  is  normally  rapidly  responsive 
to  environmental  cues  and  has  been  dubbed,  “the  intra¬ 
cellular  sentinel  of  the  extracellular  milieu”  [11].  By  con¬ 
trast  to  the  highly  regulated  state  of  c-Myc  and  the 
absence  of  N-myc  and  L-myc  expression  in  normal  cells, 
cancer  cells  often  harbour  deregulated  expression  of  any 
one  of  these  three  myc  oncogenes  [11]. 

The  founding  member  of  the  family,  c-myc,  was  first 
shown  to  be  an  oncogene  when  it  was  identified  as  the 
transduced  v-myc  gene  of  the  transforming  avian  myelo- 
cytomatosis  retrovirus  [12],  Deregulated  c-Myc  expres¬ 
sion  was  subsequently  shown  to  be  prolific  in  human 


cancers.  In  100%  of  Burkitt’s  lymphoma,  c-Myc  is  trans¬ 
located  with  an  immunoglobulin  enhancer  that  drives 
high  levels  of  constitutive  c-myc  mRNA  and  protein 
expression,  which  is  instrumental  in  initiating  the  disease 
[13],  Translocations  involving  the  c-Myc  locus  have  also 
been  reported  in  several  additional  tumours,  including 
diffuse  large  cell  lymphoma,  T-cell  acute  lymphocytic 
leukaemia  and  multiple  myeloma  [14],  Amplification  of 
c-Myc  and/or  deregulated  expression  is  evident  in  many 
tumours  including  melanomas  and  carcinomas  of  the 
breast,  prostate  and  colon  [5,6,14].  Amplification  of 
MYCN,  is  a  hallmark  of  neuroblastoma  [15,16],  while 
MYCL1  is  amplified  in  ovarian  cancer  [17].  All  three 
transforming  members  of  the  myc  family  can  be  ampli¬ 
fied  in  non-small  cell  lung  carcinoma  [14],  Importantly, 
in  recent  years  it  has  become  clear  that  deregulation  is 
not  restricted  to  gross  genetic  abnormalities  of  the  myc 
gene  family,  such  as  translocation  or  amplification,  but 
can  also  occur  as  a  consequence  of  direct  or  indirect 
mutations  of  regulatory  molecules  controlling  myc  gene 
expression  [11].  Thus,  deregulation  of  Myc  expression  is 
evident  in  numerous  human  cancers  of  diverse  origin 
and  can  result  from  mutations  at  one  or  multiple  levels 
of  regulation.  Unless  otherwise  stated,  Myc  will  refer  to 
data  described  for  c-Myc,  but  these  results  are  often  rel¬ 
evant  for  the  highly  similar  N-Myc  and  L-Myc 
oncoproteins. 

In  response  to  signals  from  the  cellular  environment, 
Myc  can  regulate  a  broad  variety  of  distinct  biological 
activities  (Fig.  1).  In  addition  to  Myc  driving  cell  prolif¬ 
eration,  growth,  and  transformation,  deregulated  Myc 
has  also  been  shown  to  increase  apoptosis,  genomic 
instability,  and  angiogenesis  as  well  as  block  differentia¬ 
tion  [18-28].  The  prevailing  model  is  that  Myc  controls 
such  a  disparate  set  of  activities  by  regulating  distinct 
cohorts  of  target  genes  that  then  orchestrate  each  activ¬ 
ity.  One  of  the  major  gaps  in  the  field  is  linking  Myc 
function,  as  a  transcriptional  regulator,  to  the  wide- 
range  of  biological  activities  controlled  by  Myc  [29],  In 
the  following  sections  we  provide  an  overview  of  the 
enormous  effort  from  several  labs  to  identify  the  target 
genes  regulated  by  Myc  and  to  uncover  the  protein-pro¬ 
tein  interactions  that  are  essential  for  Myc  to  function  as 
a  transcription  factor  in  transformation. 

1.2.  Myc  structure 

Traditionally  the  Myc  protein  has  been  divided  into 
an  N-terminal  domain  (NTD)  involved  in  transactiva¬ 
tion  and  transrepression;  and  a  C-terminal  domain 
(CTD)  that  is  critical  for  DNA  binding.  Myc  is  a  tran¬ 
scription  factor  of  the  basic  helix-loop-helix  leucine  zip¬ 
per  (b-HLH-LZ)  superfamily  [30].  The  C-terminus 
harbours  both  the  primary  nuclear  localisation  signal 
and  the  basic  motif  required  for  binding  to  the  CAC- 
GTG  E-box  DNA  recognition  sequence  (Fig.  2) 
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[31.32] .  The  HLH-LZ  domain,  essential  for  all  known 
Myc  activities,  is  the  heterodimerisation  domain  that  is 
required  for  Myc  to  bind  to  its  primary  partner  protein. 
Max  (Myc  associated  protein  X). 

The  N-terminus  is  a  major  regulatory  region  respon¬ 
sible  for  assembly  of  the  transcriptional  machinery  [11]. 
Within  the  N-terminus  there  are  several  highly  con¬ 
served  sequences  termed  Myc  boxes  which,  together 
with  the  C-terminal  b-HLH-LZ,  define  the  Myc  family 
of  proteins  (Fig.  2).  Although  Myc  box  I  (MBI)  is  re¬ 
quired  for  gene  activation,  deletion  of  this  region  only 
partially  abolishes  the  transforming  ability  of  Myc 

[11.33] .  Myc  box  II  (MBII)  is  essential  for  the  ability 
of  Myc  to  transform,  drive  cell  proliferation,  inhibit  dif¬ 
ferentiation,  repress  gene  transcription,  and  activate  cer¬ 
tain  target  genes  [11].  Recently,  a  third  conserved  region 
of  Myc  has  been  described,  Myc  Box  III  (MBIII),  that 
plays  a  role  in  transformation,  lymphomagenesis  and 
apoptosis  [34],  Interestingly,  recent  results  using  circular 
dichroism  indicated  that  the  N -terminal  domain  shows 
little  to  no  inherent  secondary  structure,  suggesting  that 
protein-protein  interactions  are  essential  for  proper 
folding  and  function  of  Myc  [35,36].  From  a  therapeutic 
perspective  this  observation  offers  hope  that  specific 
inhibitors  can  disrupt  unique  points  of  interaction  be¬ 
tween  Myc  and  its  binding  partners  thereby  inhibiting 
transformation.  This  may  offer  a  novel  approach  to  tar¬ 
geting  oncogenic  Myc  in  a  tumour-specific  manner. 
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Fig.  2.  The  structural  domains  of  human  c-Myc  and  their  link  to 
protein-protein  interactions.  c-Myc  contains  at  least  six  regions  which 
are  highly  conserved  between  Myc  paralogs  and  orthologs.  The  Myc 
N-terminal  domain  (NTD)  is  defined  as  amino  acids  1-262  and 
contains  Myc  Box  I  (MBI),  Myc  Box  II  (MBII)  and  Myc  Box  III 
(MBIII)  and  the  acidic  region  (AR).  The  Myc  C-terminal  domain 
(CTD)  is  defined  as  residues  263^t39  and  contains  the  primary  nuclear 
localisation  signal  (NLS)  and  the  basic  helix-loop-helix  leucine  zipper 
domains  (B-HLH-LZ).  The  regions  of  c-Myc  necessary  for  the 
interaction  of  the  specific  proteins  are  shown. 


2.  Myc  interacting  proteins 

For  Myc  to  hold  its  extensive  role  in  the  control  of 
cellular  function,  a  network  of  key  binding  proteins  is 
required  (Fig.  2).  The  identification  of  this  network 
started  with  the  discovery  of  Max  over  a  decade  ago 
by  screening  a  human  cDNA  library  with  a  radiolabeled 
fusion  protein  containing  the  Myc  CTD  [37],  In  recent 
years,  several  additional  Myc-binding  proteins  have  pri¬ 
marily  been  identified  using  biochemical  and  two-hybrid 
methodologies.  The  Myc-Max  interaction  is  already 
being  explored  as  a  therapeutic  target,  while  inhibitors 
to  other  key  interacting  proteins  more  recently  identi¬ 
fied,  such  as  TRRAP,  have  yet  to  be  investigated.  A 
new  array  of  interactors  has  been  identified  in  recent 
years,  but  their  role  in  Myc  dependent  transformation 
and  the  effect  of  their  disruption  needs  to  be  further 
evaluated  before  they  are  justifiable  targets  for  therapeu¬ 
tic  intervention. 


2.1.  Myc  and  Max:  from  structure  to  biology 

Myc  activity  is  entirely  dependent  upon  dimerisa- 
tion  with  Max,  an  abundant,  ubiquitously  expressed 
b-HLH-LZ  protein.  Recently,  the  co-crystal  structure 
has  been  solved  and  shows  that  homodimers  of  Max 
are  held  together  by  polar  interactions  alone,  while 
Myc-Max  heterodimers  are  stabilised  by  charged 
interactions  [38],  This  results  in  Myc-Max  dimers 
forming  more  easily  than  Max-Max  dimers.  The  crys¬ 
tal  structure  shows  that  Myc-Max  dimers  are  stabi¬ 
lised  by  hydrophobic  and  polar/charged  interactions 
via  helices  in  the  LZ  region.  Two  positively  charged 
residues  in  Myc  form  a  tetrad  with  Max,  and  these 
two  pairs  of  hydrogen  bonds  alone  control  heterodi- 
mer  specificity  with  Max.  This  difference  between 
charged  and  polar  residues  also  explains  the  disfavour 
of  Myc-Myc  homodimers,  caused  by  electrostatic 
repulsions  between  the  complementary  residues 
[38,39].  Perhaps  one  of  the  most  interesting  findings 
from  the  crystallographic  data  is  the  formation  of  a 
tetramer  with  two  Myc-Max  heterodimers  oriented 
head  to  tail  of  the  LZ,  each  binding  a  DNA  E-box. 
It  has  been  proposed  that  biologically  they  may  bind 
widely  separated  E-box  sequences;  however,  the  crys¬ 
tal  structure  reveals  no  specific  hydrogen  bond  interac¬ 
tions  to  stabilise  this  tetramer  [38],  Another  function 
of  the  heterotetramer  might  be  the  formation  of  a 
platform  for  assembly  of  additional  protein  factors 
such  as  Miz-1,  IN  II  and  BRCA1  which  bind  to  the 
b-HLH-LZ  region  of  Myc  (Fig.  2).  The  detailed  struc¬ 
ture  of  Myc-Max  may  give  rise  to  novel  therapeutic 
strategies  that  interfere  with  either  the  formation  of 
the  tetramer  or  the  association  with  Myc  CTD 
interactors. 


2488 


R.  Ponzielli  et  al.  I  European  Journal  of  Cancer  41  ( 2005 )  2485-2501 


2.2.  Myc  interaction  with  chromatin  remodeling  proteins 

In  addition  to  the  well-documented  association  with 
Max,  Myc  has  been  shown  to  interact  with  a  number 
of  additional  transcription  factors  and  co-factors  that 
modulate  its  activity. 

The  DNA  bound  Myc-Max  heterodimer  interacts 
through  the  Myc  N-terminal  region  with  a  variety  of 
proteins  involved  in  transcription.  TRRAP  is  of  partic¬ 
ular  interest  as  it  forms  part  of  a  multiprotein  complex 
with  histone  acetyl-transferase  (HAT)  activity.  Myc  con¬ 
tributes  to  chromatin  remodeling  through  an  MBII- 
dependent  interaction  with  TRRAP  [40,41],  Inhibition 
of  TRRAP  synthesis  or  function  blocks  Myc-mediated 
oncogenesis,  establishing  an  essential  role  for  TRRAP 
in  Myc  activity  [42].  The  yeast  homologue  of  TRRAP, 
tral  is  a  component  of  the  SAGA  (SPT/ADA/GCN5/ 
Acetyltransferase)  complex  [43]  which  also  contains 
the  HAT  GCN5.  The  human  homologue,  hGCN5,  has 
been  shown  to  interact  in  a  complex  with  Myc  through 
TRRAP  [44].  A  Myc-Gcn5  fusion  protein  can  partially 
restore  the  ability  of  a  Myc  MBII  deletion  mutant  to  in¬ 
duce  transformation,  suggesting  that  recruitment  of 
HAT  activity  is  an  important  component  of  Myc  func¬ 
tion.  Myc  also  interacts  with  the  p400  complex  contain¬ 
ing  TRRAP,  surprisingly  this  complex  lacks  HAT 
activity  suggesting  that  Myc-TRRAP  interaction  may 
serve  additional  roles  in  addition  to  recruiting  HATs. 
Nevertheless,  p400  function  in  Myc-mediated  oncogene¬ 
sis  remains  unclear  [45].  The  recent  discovery  that  CBP 
(CREB  binding  protein)  binds  to  the  Myc  CTD  pro¬ 
vides  an  additional  link  between  Myc  CTD  and  activa¬ 
tion  of  transcription.  CBP  functions  partly  through  its 
HAT  activity  and  was  shown  to  interact  with  Myc 
in  vivo  and  to  stimulate  Myc  dependent  transactivation. 
Interestingly,  CBP  also  acetylates  Myc  in  vitro  and  co¬ 
expression  of  the  two  proteins  resulted  in  stabilisation 
of  Myc  [46],  Functionally,  the  biological  significance 
of  this  interaction,  in  particular  for  Myc-mediated  trans¬ 
formation,  is  yet  to  be  determined. 

Myc  interacts  with  several  other  proteins  implicated 
in  chromatin  remodeling,  namely  TIP48  and  TIP49 

[47] .  Interaction  of  these  molecules  with  Myc  requires 
the  Myc  NTD  and  occurs  independently  of  TRRAP 
binding.  T1P48  and  T1P49  have  ATP  hydrolysing  activ¬ 
ity,  as  well  as  suspected  helicase  activity  and  have  been 
shown  to  be  required  for  the  foci  formation  by  Myc 
and  Ras  in  a  primary  co-transformation  assay  [47], 

Myc  may  also  be  involved  in  a  second  mechanism  of 
chromatin  remodeling:  an  ATP-dependent  process 
involving  the  recruitment  of  the  SWI/SNF  complex, 
which  regulates  transcription  through  nucleosome  repo¬ 
sitioning.  Myc  directly  interacts  with  INIl/hSNF5,  a 
key  component  of  the  SWI/SNF  complex,  and  this 
was  suggested  to  stimulate  Myc  transcriptional  activity 

[48] .  INIl/hSNF5  appears  to  be  a  tumour  suppressor, 


and  is  mutated  in  the  majority  of  atypical  teratoid  and 
malignant  rhabdoid  tumours  [49-51].  More  recently, 
Myc  has  been  shown  to  interact  with  BAF53,  an  ac- 
tin-related  protein  which  is  another  integral  component 
of  the  SWI/SNF  chromatin  remodeling  complex.  Tar¬ 
geted  mutations  in  BAF53  inhibit  oncogenic  transfor¬ 
mation  by  Myc  [52].  In  that  context,  it  will  be 
important  to  determine  if  other  proteins  of  the  SWI/ 
SNF  complex  have  the  potential  to  interact  with  Myc 
and  whether  SWI/SNF  recruitment  may  play  a  role  in 
Myc-mediated  transformation.  Despite  this  enormous 
progress,  many  questions  remain.  Which  genes  are  tar¬ 
geted  by  each  of  these  distinct  complexes?  Does  each 
complex  drive  a  specific  genetic  program,  such  as  trans¬ 
formation  or  apoptosis?  Clearly,  the  precise  role  of  these 
ATPase/helicase  family  proteins  in  Myc  biology  requires 
further  study  and  this  knowledge  will  be  instructive  in 
determining  which  complex  is  of  highest  priority  for 
therapeutic  targeting  to  block  Myc  transforming 
function. 

2.3.  Interactions  with  other  transcriptional  regulators 

Whereas  TRRAP  appears  to  be  a  positive  regulator 
of  Myc-mediated  transformation,  Bin-1  (Bridging  pro¬ 
tein-1)  appears  to  be  a  negative  regulator  [53],  The 
mechanism  by  which  Bin-1  represses  transformation  is 
not  fully  understood,  but  it  includes  effects  on  the  cell 
cycle  as  well  as  the  promotion  of  apoptosis  in  response 
to  Myc  over-expression  [54].  Since  Bin-1  is  deleted  in  a 
variety  of  tumours  we  sought  to  better  define  its  interac¬ 
tion  with  Myc.  Our  group  has  used  nuclear  magnetic 
resonance  (NMR)  spectroscopy  and  biochemical  assays 
to  define  the  mechanism  of  interaction  between  Bin-1 
and  the  MYC  NTD  [55].  We  showed  that  a  small  pro- 
line-rich  peptide  within  the  conserved  MBI  interacts 
with  the  SH3  domain  of  Binl  and  that  this  interaction 
can  be  disrupted  by  phosphorylation  of  Myc  Ser62. 
Our  findings  raise  the  intriguing  possibility  that  the  con¬ 
served  MBI  region  may  bind  to  other  SH3  domain  pro¬ 
teins.  In  addition,  the  data  highlights  the  significance  of 
post-translational  modifications  to  Myc  activity  and 
suggests  their  modulation  as  an  opportunity  for  thera¬ 
peutic  intervention. 

Due  to  the  evidence  that  Myc  NTD  binding  proteins 
are  critical  to  Myc  function,  we  developed  a  novel  high 
throughput  screen  termed  the  repressed  transactivator 
assay  (RTA)  [56],  This  two  hybrid  approach  enables 
the  full  Myc  NTD  to  be  used  as  bait  in  the  context  of 
Myc  as  a  transactivator  bound  to  DNA.  Novel  meth¬ 
ods,  such  as  the  RTA,  will  greatly  facilitate  the  identifi¬ 
cation  of  Myc-binding  proteins  and  inhibitors  to  disrupt 
these  interactions.  Using  the  RTA  we  screened  a  library 
derived  from  medulloblastoma  cells  with  the  Myc  NTD 
as  bait.  A  novel  Myc  interactor,  JP02,  was  isolated  and 
shown  to  be  a  transcription  factor  containing  a  putative 
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LZ  and  a  ring  finger  domain.  JP02  is  closely  related  to  a 
Myc  transcriptional  target,  JPOl,  and  was  also  recently 
identified  as  a  novel  member  of  this  emerging  JPO  fam¬ 
ily  [57-59],  We  show  that  JP02  expression  affects 
anchorage  independent  growth  and  is  associated  with 
metastasis  in  medulloblastomas  [57],  As  we  have  also 
observed  Myc  interaction  with  JPOl  protein,  an  inter¬ 
esting  working  model  at  a  molecular  level  that  emerges, 
sees  Myc  inducing  JPOl  transcription  for  the  formation 
of  a  Myc-JPO  complex  as  a  positive  feedback  loop  to 
regulate  gene  transcription.  It  will  be  interesting  to 
determine  the  validity  of  this  model  for  the  JPO  family 
and  to  determine  if  additional  Myc  induced  genes  can 
also  form  functional  transcription  regulatory  complexes 
with  Myc. 

Another  novel  Myc  NTD  binding  protein  is  Krim-1, 
a  nuclear  zinc  finger  protein  that  contains  a  KRAB  do¬ 
main.  Krim-1  was  identified  using  the  Ras  Recruitment 
System  (RRS)  and  was  shown  to  associate  with  MBII 
[60,61].  In  a  reporter  assay,  Krim-1  was  able  to  nega¬ 
tively  regulate  Myc  transactivation  and  inhibit  its  onco¬ 
genic  activity  in  REFs,  a  phenotype  that  is  reduced  in 
the  presence  of  a  Myc-binding  mutant  of  Krim-1  [61]. 

Another  interactor  is  the  ARF  tumour  suppressor 
protein,  which  binds  directly  to  Myc  and  inhibits  its 
transcriptional  activity  in  a  p53-independent  fashion. 
ARF  blocks  Myc  ability  to  activate  transcription  with¬ 
out  affecting  its  ability  to  repress  transcription.  ARF 
prevents  Myc-induced  transformation  whereas  Myc- 
induced  apoptosis  remains  intact  even  in  the  absence 
of  p53.  ARF  was  shown  to  bind  both  Myc  NTD  and 
HLH-LZ  domains  of  Myc  [62,63].  These  findings  sug¬ 
gest  a  safeguard  mechanism  for  preventing  aberrant 
Myc  signalling.  The  question  naturally  follows:  does 
ARF  differentially  bind  to  a  cohort  of  Myc-target  genes 
related  to  proliferation  and/or  transformation?  Studies 
using  specific  inhibitors  of  the  Myc-ARF  interaction  will 
provide  a  direct  link  between  ARF  suppression  of  tumo- 
urigenesis  and  the  control  of  Myc  transcriptional  activ¬ 
ities.  These  findings  highlight  the  critical  role  of  MBII  in 
transcriptional  regulation  and  suggest  that  complexes, 
such  as  TRRAP,  may  either  compete  or  cooperate  with 
other  multiprotein  complexes  for  binding  with  ARF  or 
Krirnl. 

Recently,  several  additional  Myc-binding  proteins 
have  been  described,  but  their  role  in  transformation 
remains  unclear.  For  example,  Myc  and  PML  were 
shown  to  co-localise  within  discrete  nuclear  structures 
associated  with  the  nuclear  matrix,  termed  PML 
bodies  [64,65],  These  PML  bodies  associate  with  re¬ 
gions  of  high  transcriptional  activity  in  the  genome 
and  have  been  implicated  in  diverse  cellular  processes, 
although  their  specific  function  remains  open  [66].  The 
differential  effect  that  PML  appears  to  have  on  Myc 
activation  is  also  interesting.  In  PML-null  mouse 
embryonic  fibroblasts  (MEFs)  the  expression  levels 


of  numerous  Myc  target  genes  is  altered  [64,65]. 
Although  PML  was  demonstrated  to  physically  inter¬ 
act  with  the  Myc  NTD  and  is  recruited  at  Myc  regu¬ 
lated  promoters,  the  exact  mechanism  by  which  PML 
influences  Myc  activity  remains  unclear.  Further  in¬ 
sight  into  the  precise  contribution  of  these  interactors, 
as  well  as  others  such  as  PARP-10  [67]  or  human  pap¬ 
illomavirus  E6  [68],  to  the  function  of  oncogenic  Myc 
in  the  carcinogenic  process  will  determine  whether  tar¬ 
geting  these  proteins  may  be  of  merit. 


2.4.  Interactors  that  regulate  Myc  stability 

Multiple  mechanisms  ensure  proper  control  of  Myc 
activity  in  normal  cells,  including  regulation  of  Myc 
protein  turnover,  through  the  ubiquitin-proteosome 
pathway  [69,70].  Recently,  a  component  of  the  E3 
ubiquitin  ligase  complex,  Skp2,  has  been  reported  to 
mediate  Myc  turnover  in  vivo,  which  in  turn  is  linked 
to  Myc  transcriptional  activity  at  specific  target  genes. 
The  regions  of  Skp2-Myc  interaction  were  mapped  to 
MBII  as  well  as  to  Myc  CTD  (Fig.  2)  [71,72],  Consis¬ 
tent  with  its  role  as  a  co-factor  for  Myc,  Skp2  induces 
S-phase  entry  in  a  Myc-dependent  manner  [71],  Since 
Skp2  is  known  to  have  oncogenic  properties  [73,74]  its 
role  in  increasing  Myc  transcriptional  activity  is  not 
surprising.  Although  Skp2  is  required  for  the  phos¬ 
phorylation-dependent  degradation  of  various  pro¬ 
teins,  such  as  the  tumour  suppressor  FOXOl  [75], 
there  is  no  evidence  suggesting  the  effect  of  Skp2  on 
Myc  is  regulated  at  the  level  of  phosphorylation. 
Phosphorylation-dependent  degradation  of  Myc  in¬ 
volves  two  key  residues,  T58  and  S62  within  the 
MBI.  Phosphorylation  of  S62  is  mediated  by  Ras  sig¬ 
nalling  and  is  a  prerequisite  to  the  phosphorylation  of 
T58  that  is  performed  by  glycogen  synthase  kinase  3 
(GSK3)  [70,76-78].  Phosphorylation  at  T58  destabi¬ 
lises  Myc  protein  [70,79],  T58  represents  a  major  hot¬ 
spot  for  mutations  in  Burkitt’s  and  other  lymphomas 
[80].  T58-phosphorylated  Myc  is  specifically  bound 
by  the  F-box  protein,  Fbw7,  and  results  in  Myc  ubiq- 
uitination  and  degradation  [80-83].  Fbw7  appears  to 
function  as  a  tumour  suppressor  gene  [84],  Knock¬ 
down  of  Fbw7  increased  both  the  abundance  and 
transactivation  activity  of  endogenous  Myc.  Surpris¬ 
ingly,  one  of  the  Fbw7  isoforms  (Fbw7y)  co-localises 
with  Myc  in  the  nucleolus  upon  proteosome  inhibition 
suggesting  that  Myc  is  also  regulated  in  this  specia¬ 
lised  nuclear  compartment  for  degradation  [81],  These 
data  support  the  idea  that  Fbw7  and  Skp2  have 
opposing  effects  on  Myc  activity  by  targeting  MBI 
and  MBII,  respectively.  This  knowledge  can  be 
exploited  for  therapeutics  as  targeting  MBI  might  af¬ 
fect  Myc  stability  while  targeting  MBII  would  affect 
Myc  transcriptional  activity. 
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3.  Myc  regulated  target  genes 

To  understand  the  role  of  Myc  in  cellular  physiology 
and  pathology,  it  is  essential  to  identify  the  bona  fide  tar¬ 
get  genes  regulated  by  Myc.  A  bona  fide  Myc-target  gene 
is  one  whose  regulatory  region  is  bound,  directly  or  indi¬ 
rectly  by  Myc,  and  whose  expression  is  then  regulated 
under  an  appropriate  stimulus.  These  targets  are  distin¬ 
guished  from  the  many  regulatory  events  that  occur  as  a 
downstream  consequence  of  Myc  activity,  like  cell  cycle 
progression  [1 1],  In  the  past,  many  criteria  were  required 
to  distinguish  a  gene  as  a  true  Myc  target  gene,  but  none 
were  absolute.  Several  cDNA  microarray  studies  have 
identified  many  bona  fide  targets  and  downstream  regu¬ 
lated  genes,  but  rarely  have  the  two  subsets  been  clearly 
distinguished  [85-88].  Recently  developed  techniques 
have  revolutionised  this  major  issue  and  allowed  true 
target  genes  to  be  rapidly  identified  and  profiled 
in  vivo,  in  any  given  cell  or  tissue  under  any  given 
stimulus.  One  such  technique  is  termed  ChIP-chip  (or 
ChIP-on-chip),  in  which  the  sensitivity  and  specificity 
of  chromatin  innnunoprecipitation  (ChIP)  is  combined 
with  the  high  throughput  capability  of  microarray  (chip) 
technology. 

3.1.  Genome-wide  Myc  binding  profile 

Knowledge  of  binding  sites  found  within  a  genome  is 
essential  for  understanding  the  target  genes  regulated  by 
any  transcription  factor.  With  the  development  of  high 
throughput  ChIP  technologies,  many  labs,  including  our 
own,  have  identified  the  in  vivo  genomic  DNA-binding 
sites  of  Myc  [29].  This  is  especially  important  for  under¬ 
standing  the  mechanisms  of  Myc  role  in  carcinogene¬ 
sis.  By  profiling  the  many  target  genes  regulated  by 
Myc,  the  anticipation  is  that  a  key  subset  essential  for 
a  particular  biological  function  will  be  distinguished 
and  the  mechanism  of  co-regulation  of  this  cohort  of 
genes  will  then  be  determined.  Several  novel  insights 
have  been  forthcoming  from  these  studies  on  Myc  geno¬ 
mic  binding  in  mammals  [89-9 1  ].  It  has  been  concluded 
that  at  sites  of  transcriptional  activation,  Myc  and  Max 
bind  together  at  high  affinity  loci  containing  canonical 
or  non-canonical  E-boxes,  which  are  often  CpG  rich 
[29,92].  Moreover,  these  characteristics  are  evolution- 
arily  conserved  from  Drosophila  to  mammals  [93].  In 
addition,  recent  work  shows  that  there  is  evolutionary 
conservation  of  promoter  architecture  across  different 
species  of  Drosophila  containing  a  single  E-box  located 
within  the  first  100  nucleotides  downstream  of  the  tran¬ 
scription  start  site  [94],  These  outcomes  suggest  that  a 
subset  of  Myc  targets  share  a  common  and  particular 
mechanism  of  regulation.  By  contrast,  the  repression 
of  the  target  genes  by  Myc  does  not  occur  at  E-boxes, 
but,  rather  at  proximal  promoter  regions  [95-98].  Inter¬ 
estingly,  the  Myc-Max  interaction  is  essential  for  Myc 


to  repress  as  well  as  activate  gene  transcription  [91].  A 
systematic  analysis  across  promoters,  employing  short 
oligonucleotide  arrays,  showed  that  only  22%  of  the 
Myc  binding  sites  are  located  at  the  5'  upstream  region 
of  protein-coding  genes,  while  36%  are  placed  at  the  3' 
end  of  well-characterised  genes  that  are  associated  with 
non-coding  RNAs  [99],  In  light  of  this  study  it  is  very 
interesting  that  two  recent  papers  describe  the  relation¬ 
ship  between  Myc  and  non-coding  microRNAs  (miR- 
NAs).  The  first  shows  that  Myc  regulates  a  cluster  of 
miRNAs  in  chromosome  13  that  influences  the  major 
cell  cycle  regulator,  E2F1  [100],  The  authors  proposed 
that  Myc  directly  binds  to  this  miRNA  cluster  to  nega¬ 
tively  regulate  E2F1,  thereby  dampening  the  runaway 
effect  of  Myc  inducing  E2F1  transcription.  It  will  be 
interesting  to  determine  the  function  of  this  miRNA 
cluster  in  tumour  cells.  The  second  study  shows  that  an¬ 
other  cluster  in  chromosome  13,  which  is  often  overex¬ 
pressed  in  lymphomas,  strongly  cooperates  with  Myc 
in  lymphomagenesis  by  inhibiting  Myc  apoptotic  capa¬ 
bility  [101].  The  identification  of  miRNAs  is  in  its  in¬ 
fancy  and  the  identification  of  transcriptionally  active 
sites  by  ChIP-chip  will  greatly  aid  in  delineating  onco- 
genically  active  miRNAs. 

One  of  the  most  profound  results  emanating  from  the 
genome-wide  localisation  studies  is  that  Myc  binds  to  an 
enormous  number  of  target  genes,  compared  with  other 
transcription  factors  such  as  p53  and  Spl  [99].  More¬ 
over,  the  genomic  binding  sites  are  associated  with  genes 
whose  products  are  engaged  in  a  wide-range  of  biologi¬ 
cal  processes.  In  addition,  it  is  curious  that  not  all  Myc 
bound  targets  are  regulated  at  the  level  of  expression 
[89]. These  observations  have  been  gathered  from  a  lim¬ 
ited  series  of  experiments  conducted  in  a  variety  of  labs. 
Clearly,  it  will  be  critical  to  systematically  evaluate  the 
nature  of  Myc  target  genes  and  their  regulation  under 
a  multiplicity  of  physiological  and  pathological  condi¬ 
tions.  Importantly,  these  experiments  are  now  feasible 
thanks  to  the  advances  in  ChIP-chip  technologies.  To 
take  advantage  of  these  results,  it  will  be  necessary  to 
identify  Myc  interactors  that  cooperate  in  the  regulation 
of  genes  implicated  in  Myc  related  phenotypes.  The 
target  genes  identified  to  date  are  compiled  in  a  well- 
annotated  database  http://www.myc-cancer-gene.org/ 
site/mycTargetDB.asp  [11,102,103].  Due  to  space  con¬ 
straints,  we  will  only  discuss  specific  bona  fide  targets 
in  the  context  of  understanding  their  mechanism  of  reg¬ 
ulation  by  Myc. 

3.2.  Mechanisms  of  Myc  dependent  transcriptional 
activation 

Activation  of  target  genes  by  Myc  involves  at  least 
two  regulatory  steps;  chromatin  remodeling  and  pro¬ 
moter  clearance.  Mechanistic  analyses  of  Myc-induced 
genes  have  clearly  shown  that  Myc  participates  in  chro- 
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matin  remodeling  when  recruited  to  promoter  regions. 
For  example,  the  activation  of  the  normally  silent  telo- 
merase  reverse  transcriptase  gene  (TERT)  by  oncogenic 
Myc  in  exponentially  growing  fibroblasts  requires 
TRRAP  recruitment  and  is  accompanied  by  both  H3 
and  H4  acetylation  [104],  Despite  the  profound  role 
of  this  target  gene  in  oncogenesis,  transformation  of 
primary  cells  is  thought  to  require  the  regulation  of 
additional  targets  by  oncogenic  Myc,  as  TERT  overex¬ 
pression  alone  cannot  replace  Myc  in  Rat  fibroblast 
transformation  assays  [105]  and  mice  lacking  telomerase 
RNA  can  still  be  transformed  by  Myc  and  H-Ras  [106]. 
Many  cell  cycle  components  are  also  regulated  by  Myc 
in  a  MBII/TRRAP-dependent  manner  [107].  Myc  stim¬ 
ulates  expression  of  the  Cyclin  D2  and  Cdk4  genes,  lead¬ 
ing  to  sequestration  of  the  cell  cycle  inhibitor  p27  in 
CyclinD2/Cdk4  complexes  [108-110],  Remarkably,  p27 
is  then  degraded  by  the  protein  product  of  two  Myc  acti¬ 
vated  genes,  Cull  and  Cks  [109,111,112].  Evidence  sug¬ 
gests  that  the  interaction  of  Myc-TRRAP  to  recruit 
HAT  activity  to  target  gene  promoters  such  as  TERT 
and  Cyclin  D2,  is  important  for  Myc  transformation 
and  may  be  dispensable  for  Myc  to  drive  apoptosis 
[42,44].  For  example,  MEFs  derived  from  the  Cyclin 
D1/D2  double  knock-out  mice  showed  impaired  prolif¬ 
eration  in  response  to  ectopic  Myc  expression,  but  the 
ability  of  Myc  to  potentiate  apoptosis  remained  intact 
[113]. 

In  addition  to  directly  regulating  chromatin  remodel¬ 
ing  through  the  recruitment  of  HATs,  Myc  also  induces 
target  genes  involved  in  the  regulation  of  chromatin 
dynamics.  Recently,  two  such  target  genes  have  been  de¬ 
scribed.  MT-MC1  encodes  a  nuclear  protein  with  homol¬ 
ogy  to  certain  DNA  helicases,  and  HMG-I  encodes  one 
of  the  high-mobility  group  proteins.  Interestingly,  ecto¬ 
pic  expression  of  these  proteins  in  Myc  knock-out  rat 
fibroblasts  was  shown  to  reconstitute  many  Myc  pheno¬ 
types,  such  as  rescuing  the  parental  cell  morphology,  cor¬ 
rection  of  the  slow  growth  rate,  cell  size,  genomic 
instability,  clonogenicity,  tumourigenicity  and  the  regu¬ 
lation  of  a  subset  of  Myc  target  genes  [1 14],  Even  though 
these  two  genes  sometimes  differ  in  the  reconstitution  of 
Myc  functions,  significant  overlap  exists  between  them. 
Remarkably,  these  chromatin  remodeling  proteins, 
either  individually  or  in  combination,  were  not  able  to 
complement  the  ability  of  Myc  to  potentiate  apoptosis 
following  serum  withdrawal.  These  studies  leave  the 
door  open  to  a  selective  investigation  of  additional  fun¬ 
damental  Myc  targets,  which  can  restore,  for  example, 
the  Myc  apoptotic  function. 

Myc  has  been  shown  to  participate  in  the  transcrip¬ 
tion  regulatory  step  associated  with  promoter  clearance 
of  RNA  polymerase  II  (RNAP  II)  [115],  Myc  was  previ¬ 
ously  shown  to  recruit  components  of  the  elongation 
factor  P-TEFb  (positive  transcription  elongation  factor 
b)  at  the  cad  promoter  [115,116].  Inhibition  of  the  P- 


TEFb  complex  blocked  the  effects  of  Myc  in  trans¬ 
formed  cells  [117],  Thus,  the  rate-limiting  step  of  tran¬ 
scriptional  activation  of  specific  target  genes  may  be 
Myc-mediated  recruitment  of  P-TEFb,  which  then  al¬ 
lows  phosphorylation  of  RNAP  II  and  release  of  a  sus¬ 
pended  transcription  complex.  It  will  be  interesting  to 
determine  whether  a  cohort  of  genes,  like  cad,  is  simi¬ 
larly  regulated  by  Myc  and  involved  in  the  control  of 
a  common  biological  activity. 

An  attractive  new  development  in  understanding 
Myc  function  as  a  regulator  of  gene  transcription  is 
the  appreciation  that  Myc  not  only  regulates  RNAP  II 
regulated  genes,  but  also  affects  genes  transcribed  by 
RNAP  I  and  III.  In  fact,  the  various  components  of 
the  ribosomal  machinery  are  synthesised  by  all  three 
RNA  polymerases,  RNAP  I,  II  and  III  (Fig.  3).  This 
is  consistent  with  the  regulation  of  ribosomal  targets 
as  a  common  feature  of  many  expression  microarray 
analyses  to  identify  Myc  target  genes.  Ribosomal  bio¬ 
genesis  is  a  fundamental  cellular  process  that  takes  place 
in  the  nucleolus  and  is  essential  for  ribosome  assembly 
and  protein  synthesis.  Remarkably,  the  nucleoli  are  en¬ 
larged  in  cancer  cells  and  several  ribosomal  proteins  are 
overexpressed  in  tumours,  suggesting  a  correlation  be¬ 
tween  the  deregulation  of  protein  biosynthesis  and  can¬ 
cer,  leading  to  an  opportunity  for  the  development  of 
innovative  therapeutics  targeting  the  translation  ma¬ 
chine  [26].  Mechanistically,  Myc  binds  to  TFIIIB,  a 
component  of  RNAP  III  machinery,  and  stimulates 
RNAP  III  transcriptional  regulation  of  the  5S  rRNA 
gene  [118,119].  In  the  nucleoli,  Myc  directly  regulates 
RNAP  I  transcription  by  interacting  with  SL1  (TIF-IB), 
an  essential  complex  composed  of  the  TATA  binding  pro¬ 
tein  (TBP)  and  three  RNAP  I-specific  TBP-associated 
factors  (TAFs)  [120,121].  Additionally,  ChIP  experi¬ 
ments  show  that  Myc  and  Max  bind  at  non-canonical 
E-box  sequences  located  within  ribosomal  DNA  (rDNA) 
promoters.  This  association  is  followed  by  recruitment 
of  TRRAP,  enhanced  histone  acetylation,  recruitment 
of  RNAP  I,  and  activation  of  rDNA  transcription  [122- 
124],  Taken  together,  regulation  of  all  three  RNA  poly¬ 
merases  suggest  that  Myc  plays  a  key  role  within  the  cell 
to  produce  molecules  implicated  in  ribosome  biogenesis 
(Fig.  3). 

3.3.  Mechanisms  of  Myc  dependent  transcriptional 
repression 

The  molecular  mechanism  of  Myc  role  in  repression 
of  gene  transcription  remains  less  well  characterised 
than  its  role  in  activation,  yet  all  indications  suggest 
repression  is  as  important  as  activation  for  Myc  func¬ 
tion.  For  example,  structure-function  analyses  have 
linked  transformation  to  repression.  Moreover,  micro¬ 
array  and  ChIP-chip  analyses  show  that  Myc  activates 
and  represses  gene  expression  in  similar  proportions. 
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Fig.  3.  Myc  regulates  transcription  directed  by  the  three  RNA  polymerases.  Myc,  with  its  partner  Max,  binds  DNA  and  either  activates  or  represses 
transcription  of  numerous  target  genes  of  enormous  diversity,  which  are  regulated  by  RNA  polymerase  II  (RNAP  II)  and  implicated  in  a  multiplicity 
of  cellular  functions.  In  the  nucleolus,  together  with  Max  as  a  partner,  Myc  activates  the  transcription  of  the  rRNA  genes,  regulated  by  the  RNA 
polymerase  I  complex  (RNAP  I).  As  well,  Myc  activates  rRNA  5S  transcription  by  interacting  with  TFIIIB,  a  component  of  the  RNA  polymerase  III 
complex  (RNAP  III).  The  collective  regulation  of  these  three  transcriptional  functions  by  Myc  suggests  that  Myc  plays  a  key  role  within  the  cell  to 
produce  molecules  implicated  in  ribosome  biogenesis.  RNAP  I,  II  and  III  exist  as  a  part  of  multiprotein  complexes:  other  components  are  not  shown 
in  this  simplified  figure.  The  hexagon  symbolizes  various  Myc-protein  interactors  (I)  that  contribute  to  gene  regulation.  (Figure  adapted  from 
Oskarsson  and  Trumpp  [199]). 


The  target  genes  repressed  by  Myc  fall  into  several  func¬ 
tional  categories.  At  a  molecular  level  the  best  character¬ 
ised  targets  includes  those  whose  products  inhibit  cell 
proliferation  (e.g.  p  1 5,  p21,  gadd45).  Characterising 
the  mechanism  of  Myc  repression  of  these  and  other  re¬ 
pressed  target  genes  has  shown  that  Myc  does  not  ap¬ 
pear  to  directly  associate  with  the  regulatory  regions 
of  repressed  targets,  but  rather  is  recruited  to  core 
promoters  through  protein-protein  interactions.  Myc 
interacts  with  activating  transcription  factors,  such  as 
TFII-I,  NF-Y  and  Miz-1  [95,125,126],  For  example,  tran¬ 
scriptional  activation  by  Mizl  is  abolished  with  Myc 
binding,  and  the  Myc-Miz-1  complex  acts  as  a  transcrip¬ 
tional  repressor;  in  part  due  to  competition  between  p300 
and  Myc  for  binding  to  Mizl  [98].  It  was  recently  shown 
that  Myc  represses  transcription  of  p21Cip  through 
recruitment  of  the  DNA  methyltransferase  corepressor 
Dnmt3a.  Myc  and  Dnmt3a  form  a  ternary  complex  with 
Miz-1  to  corepress  p21Cip  [127],  The  precise  role  of 
Miz-1,  as  well  as  other  factors  including  YY1  [128,129], 
NF-Y  [95,130]  and  TFII-I  [126]  in  "Myc  repression 
remains  unclear  and  will  require  additional  analyses. 

Like  Myc  activation  of  gene  transcription,  Myc 
repression  can  occur  through  multiple  mechanisms. 
Myc  repression  was  thought  to  be  dependent  upon  an 
initiator  (Inr)  region  within  the  promoter,  however, 


the  presence  of  an  Inr  is  not  essential  as  genes  lacking 
an  Inr,  such  as  gadd45  and  PDGFB,  are  repressed  by 
Myc  [95,131].  Myc  has  been  shown  to  repress  gadd45 
by  a  post-RNA  polymerase  II  recruitment  mechanism 
[132].  Interestingly,  promoter  binding  and  repression 
of  PDGFBR  are  separable  activities,  since  mutant 
Myc  proteins  that  are  unable  to  repress  PDGFBR  gene 
expression,  still  bind  to  the  promoter  in  vivo  [130]. 

Currently  a  large  number  of  in  vivo  Myc  activated  or 
repressed  target  genes  have  been  compiled  from  studies 
in  different  types  of  cancer  cells.  However,  many  ques¬ 
tions  remain.  For  example,  which  target  genes  are  regu¬ 
lated  by  Myc  and  critical  for  the  carcinogenic  process? 
This  issue  will  be  essential  to  further  study  the  subset 
of  Myc  regulated  genes  that  directly  or  indirectly  con¬ 
tribute  to  the  tumour  formation.  Through  the  character¬ 
isation  of  these  target  genes,  new  possibilities  will  be 
addressed  in  the  diagnostic  field  as  well  as  in  the  devel¬ 
opment  of  novel  anticancer  therapeutics  to  target  the 
oncogenic  activity  of  Myc. 

4.  Myc  as  a  target  in  cancer  therapy 

Developing  therapeutics  to  inhibit  oncogenic  Myc 
would  have  enormous  impact  on  the  treatment  of  a 
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wide-range  of  human  cancers.  Transgenic  mouse  models 
provide  a  glimpse  of  how  profoundly  effective  blocking 
Myc  can  be  as  an  anti-cancer  target  in  vivo.  A  series  of 
elegant  experiments  using  inducible  Myc  in  hematopo- 
etic  cells  [133],  mammary  gland  [134],  liver  [135],  skin 
[135,136]  and  pancreatic  islets  [137]  have  demonstrated 
that  induction  of  oncogenic  Myc  leads  to  full-blown 
malignancies,  while  blocking  Myc  activation  in  most 
cases  results  in  tumour  regression  [138].  This  strongly 
supports  the  notion  that  targeting  Myc  in  tumours  rep¬ 
resents  a  valid  therapeutic  approach.  Caution  should  be 
exercised  as  not  all  tumours  regressed  upon  withdrawal 
of  Myc  and  a  small  proportion  proceeded  to  hematolog¬ 
ical  malignancies  [133],  while  half  of  Myc-induced 
mammary  carcinomas  acquired  mutations  in  K-Ras  or 
H-Ras,  thus  rendering  them  independent  of  Myc  status 
[134].  It  is  likely  that  anti-Myc  agents  would  have  to  be 
coupled  with  therapies  targeting  Ras  or  other  oncogenic 
pathways. 

Many  strategies  are  under  development  to  target  or 
exploit  oncogenic  Myc  in  tumour  cells  and  eradicate 
the  malignant  cellular  clones  (Fig.  4).  One  approach  is 
to  disrupt  Myc  expression  by  targeting  regulatory  steps 
ranging  from  transcription  to  translation.  Another  strat¬ 
egy  is  to  block  Myc  function  by  inhibiting  critical  pro¬ 
tein-protein  interactions  that  are  essential  for  Myc  to 
regulate  gene  transcription,  such  as  heterodimerisation 
with  Max.  The  working  assumption  is  that  targeting 
the  Myc  regulatory  network  will  trigger  the  lethality  of 


the  tumour  cell  without  causing  irreversible  damage  to 
neighbouring  normal  cells.  Such  a  tumour-specific  effect 
is  evident  with  other  inhibitors  that  target  universally 
expressed  oncogenes.  For  example,  antisense  bcl-2  will 
sensitise  tumour  cells  to  undergo  apoptosis  in  response 
to  low-dose  chemotherapy,  but  non-transformed  cells 
are  spared  [139-142],  In  this  case,  the  tumour  cell  has 
become  dependent  upon  the  deregulated  signalling  path¬ 
way  and  even  marginal  down-regulation  of  this  lifeline 
renders  it  susceptible  to  extinction.  Should  this  assump¬ 
tion  be  incorrect,  Myc  inhibitors  will  have  to  be  targeted 
by  tumour-specific  delivery  mechanisms.  Newer  strate¬ 
gies  aim  to  achieve  a  high  therapeutic  index  by  targeting 
tumour-specific  Myc-protein  interactions  and/or  gene 
regulatory  functions.  Yet  others  aim  to  exploit  the  pres¬ 
ence  of  oncogenic  Myc  expression  in  tumour  cells  to  spe¬ 
cifically  trigger  a  suicide  response  exclusively  in  these 
transformed  cells.  Both  ongoing  and  emerging  strategies 
will  be  further  discussed  below. 

4.1.  Targeting  Myc  expression 

One  of  the  first  successful  applications  of  antisense 
technology  targeted  Myc  expression,  and  through 
advancements  over  the  last  15  years,  this  approach  re¬ 
mains  at  the  forefront  of  anti-Myc  therapeutics  [8,143], 
Antisense  oligonucleotides  (ASOs)  are  short  single- 
stranded  DNA  molecules  that  specifically  target,  hybrid¬ 
ize  and  inhibit  the  mRNA  of  a  selected  gene  (Fig.  4). 


TFO*  TMPyP4  ★  * 
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■  myc  — 


Targeting  Myc:Max  inhibition  of 

interactions*  disruptors**  targets** 


Activity  has  been  shown: 
1c  In  vitro 

★  ★  In  animal  models 

★  ★★  Phase  I  clinical  trials 

☆  Not  tested 


Fig.  4.  Strategies  for  targeting  oncogenic  Myc  in  cancer.  Therapeutic  agents  are  coded  according  to  their  current  therapeutic  stage  of  development. 
Agents  that  have  demonstrated  efficacy  in  vitro  are  indicated  with  one  star.  These  include  inhibitors  that  block  Myc  expression,  such  as  triple  helix 
forming  oligonucleotides  (TFO).  Double  star  agents  have  been  tested  in  animal  tumour  models  or  xenograft  models  for  their  efficacy  to  inhibit  Myc. 
This  category  includes  disruptors  of  Myc-Max  interaction  and  agents  that  block  Myc  target  gene  function,  such  as  cationic  porphyrins  (TMPyP4) 
and  small  interfering  RNAs  (siRNA).  Some  of  the  antisense  oligonucleotides  (ASO)  that  have  successfully  completed  Phase  I  clinical  trials,  and  are  at 
advanced  stage  of  drug  development  are  indicated  by  three  stars.  Finally,  a  clear  star  indicates  additional  potential  targets  for  the  therapeutic 
interference  of  Myc-protein  interactors  (I). 
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The  DNA-RNA  duplex  then  recruits  RNase  H  endonu¬ 
clease  to  cleave  the  RNA  strand  in  the  duplex  [144], 
leaving  the  antisense  DNA  intact  to  hybridize  to  addi¬ 
tional  mRNAs  of  the  target  gene  [145,146].  Antisense 
technology  has  evolved  as  a  viable  therapeutic  alterna¬ 
tive  by  increasing  the  functional  stability  and  permeabil¬ 
ity  of  the  ASOs.  This  has  been  largely  achieved  by 
replacing  the  phosphodiester  backbone  with  a  nuclease 
resistant  phosphorothioate  linkage  (PS  ASOs)  [147,148]. 

Myc  ASOs  have  advanced  successfully  through  the 
many  stages  of  preclinical  evaluation  and  have  demon¬ 
strated  anti-cancer  potential.  Evaluation  in  vitro  has 
shown  that  downregulating  Myc  expression  by  PS  ASOs 
reduces  leukaemic  cell  proliferation,  induces  differentia¬ 
tion  and  inhibits  Gl/S  progression  [149,150].  Moreover, 
exposure  of  MCF-7  breast  cancer  cells  to  Myc  ASOs  in¬ 
duced  growth  arrest  [151].  In  mouse  models,  treatment 
with  Myc  PS  ASOs  resulted  in  the  delay  or  prevention 
of  Burkitt’s  lymphoma  [152-154]  and  Myc  ASOs  en¬ 
hanced  the  efficacy  of  cisplatin  to  target  melanoma  both 
in  vitro  and  in  vivo  [155].  Cisplatin  resistance  can  be 
overcome  by  inhibiting  Myc  using  AVI-4126,  a  phosp- 
horodiamidate  morpholino  oligomer  (PMO),  in  the  Le¬ 
wis  lung  carcinoma  model  [156].  AVI-4126  also 
inhibited  growth  of  a  murine  prostate  cancer  xenograft 
by  inducing  growth  inhibition  and  apoptosis  in  vivo 
[157].  These  results  have  led  to  Phase  I  clinical  trials, 
which  show  that  intravenous  administration  of  the  mor¬ 
pholino  oligomer  was  not  accompanied  by  toxicity  or 
serious  adverse  events,  and  importantly,  the  bioavail¬ 
ability  was  measurable  in  malignant  tumours.  The  num¬ 
ber  of  patients  studied  remains  modest,  yet  this  data 
supports  the  use  of  the  AVI-4126  PMO  as  a  potential 
therapeutic  for  cancer  treatment  [158].  It  will  be  interest¬ 
ing  to  monitor  the  efficacy  of  this  agent  as  it  advances 
through  to  Phase  II/III  clinical  trials. 

Several  additional  Myc  ASOs  show  promise  for  fu¬ 
ture  application  as  anti-cancer  agents.  Administration 
of  INX-6295,  a  16-mer  Myc  PS  ASO  encapsulated  in  li¬ 
pid  particles,  shows  antitumour  efficacy  against  a  human 
melanoma  xenograft.  When  administered  with  cisplatin, 
INX-6295  produced  complete  tumour  regression  in  30% 
of  treated  mice  [159].  Furthermore,  the  combined  appli¬ 
cation  of  bcl-2  ASO/cisplatin/INX-6295  in  mice  har¬ 
bouring  human  melanoma  xenografts  overexpressing 
either  bcl-2  or  c-myc  oncogenes  resulted  in  effective 
antitumour  therapy  [160],  Yet  another  agent,  the  novel 
psoralen-  or  acridine-modified,  clamp-forming  ASOs, 
can  downregulate  Myc  expression  and  synergise  with 
cisplatin  to  inhibit  melanoma  cell  proliferation  and  tu¬ 
mour  progression  [161,162].  The  peptide  nucleic  acid 
(PNA)  ASO  also  shows  promise.  PNA  is  a  synthetic 
DNA  in  which  the  sugar-phosphate  backbone  is 
replaced  with  a  polyamide-(2-aminoethyl)  glycine- 
skeleton.  This  modification  provides  PNAs  with  a  long 
half-life  and  enables  PNAs  to  specifically  hybridise  to 


DNA  and/or  RNA  in  a  complementary  manner,  form¬ 
ing  a  strong  and  effective  duplex  that  can  inhibit  tran¬ 
scription  and  translation  of  the  target  gene  [163],  Myc 
expression  was  rapidly  downregulated  in  Burkitt’s  Lym¬ 
phoma,  using  a  17-mer  anti -c-myc  PNA  covalently 
linked  to  a  nuclear  localisation  signal  (NLS)  (PNA- 
mycwt-NLS)  [164].  However,  because  this  PNA  blocked 
Myc  in  both  transformed  and  non-transformed  cells, 
further  innovative  refinements  of  this  strategy  are  under 
development  to  increase  tumour  specificity  [165]. 
Clearly,  ASO  technology  targeting  Myc  expression  is 
well  advanced  and  has  enormous  promise  for  future 
application  to  patient  care.  Maximal  efficacy  will  depend 
on  achieving  synergy  with  conventional  chemotherapies 
and/or  novel  molecular  anti-cancer  agents  [166,167], 

RNA  interference  (RNAi)  is  a  modern  and  popular 
approach  to  knockdown  gene  expression  that  has  poten¬ 
tial  for  drug  development.  RNAi  is  a  mechanism  for 
silencing  gene  expression  through  targeting  double- 
stranded  RNA  to  mRNA  resulting  in  degradation  of 
the  targeted  mRNA  (Fig.  4).  In  mammalian  cells,  long 
double-strand  RNAs  are  cleaved  into  small  interfering 
RNAs  (siRNAs)  that,  through  incorporation  into  the 
silencing  complex,  direct  target  recognition.  RNAi  has 
been  touted  as  the  next  major  tool  in  targeted  cancer 
therapy,  because  of  its  impressive  specificity  and  effi¬ 
cacy.  Compared  to  antisense  approaches,  siRNAs  are 
1000-fold  more  active  [168]  and  a  plethora  of  data  shows 
the  efficacy  of  siRN  As  in  cell  culture.  For  example,  Myc 
siRNA  effectively  inhibited  Myc  protein  levels  in  MCF7 
cells  [169].  It  appears  that  siRNA  can  be  effective  in  vivo 
[170],  but  this  has  not  yet  been  thoroughly  explored.  The 
primary  limitations  of  siRNA  in  vivo  remain  its  stability 
and  delivery.  siRNA  is  quickly  degraded  in  plasma,  so 
the  duplex  will  need  to  be  chemically  modified  for  use 
as  a  drug.  Perhaps  strategies  used  to  modify  ASOs  to  in¬ 
crease  stability  can  be  exploited  for  siRNA.  Recently  the 
issue  of  delivery  was  addressed  by  fusing  multiple  siR¬ 
NAs,  targeting  Myc,  Hdm2  and  VEGF  mRNAs,  to  a 
positively  charged  protein  that  was  covalently  linked 
to  a  specific  antibody.  This  ensured  that  only  melanoma 
cells  ectopically  expressing  that  particular  ligand  were 
growth  inhibited  in  vivo  [171].  With  this  novel  approach 
it  may  be  possible  to  deliver  such  a  lethal  siRNA  cock¬ 
tail  through  a  Myc-induced  cell  surface  molecule  to 
achieve  tumour-specific  targeting.  Whether  siRNAs  tar¬ 
get  mutated  oncogenes  like  Ras  [172]  and/or  deregulated 
oncogenes  like  Myc  [169],  these  new  tools  are  likely  to 
make  a  firm  and  lasting  entrance  into  the  arsenal  of  ther¬ 
apeutics  in  the  fight  against  cancer. 

Another  series  of  agents  have  been  designed  to  target 
myc  expression  at  the  transcriptional  level.  One  promis¬ 
ing  agent  includes  the  triple  helix  forming  oligonucleo¬ 
tides  (TFOs)  which  bind  to  double-stand  purine-rich 
DNA  within  promoter  regions  and  block  transcription 
factor  binding  (Fig.  4).  Phosphothioate  stabilised  TFOs 
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directed  against  the  promoter  region  of  c -myc  have  been 
shown  to  successfully  inhibit  Myc  expression  in  several 
leukaemia  and  lymphoma  cell  lines  leading  to  cellular 
growth  arrest  and  apoptosis  [173,174].  TFOs  conjugated 
to  the  DNA-intercalating  agent  daunomycin  specifically 
downregulated  endogenous  Myc  in  prostate  and  breast 
cancer  cell  lines  [175],  Cationic  porphyrin  TMPyP4, 
which  inhibits  Myc  transcription  by  blocking  G  quadru- 
plexes,  a  DNA  structure  formed  from  G  rich  single¬ 
strand  DNA  during  active  transcription  is  another 
approach  to  target  Myc  [176]  (Fig.  4).  TMPyP4  inhibited 
the  in  vitro  transcription  of  myc  and  decreased  tumour 
growth  rates  in  xenograft  models  [176],  The  original  use 
of  cationic  porphyrin  was  to  block  telomerase  by  stabilis¬ 
ing  the  telomeric  G  rich  single  stand  DNA  overhangs  into 
G  quadruplexes  [176],  New  Myc-specific  porphyrin 
analogues  show  promising  results,  suggesting  that  this 
approach  has  merit  for  further  development  [177,178]. 
Introduction  of  G  quadruplex-forming  oligonucleotides 
into  Burkitt’s  lymphoma  cells  resulted  in  growth  inhibi¬ 
tion  by  sequestering  the  factors  that  normally  bind  to 
the  native  G  quadruplexes  formed  at  the  Myc  promoter 

[179] .  It  will  be  interesting  to  monitor  the  development 
of  these  various  agents  that  target  myc  transcription  for 
their  ultimate  use  as  therapeutics. 

4.2.  Targeting  Myc-Max  interaction 

Breaking  the  Myc-Max  bond  would  clearly  destroy 
oncogenic  Myc  function  and  several  strategies  to  dis¬ 
solve  this  partnership  are  well  underway  (Fig.  4).  Many 
of  the  approaches  under  development  have  been  ad¬ 
vanced  because  of  our  knowledge  of  the  detailed  struc¬ 
tural  biology  of  this  protein-protein  interaction  and 
the  essential  residues  involved.  Molecular  modeling 
and  mutagenesis  have  been  used  to  identify  specific 
amino  acids  that  alter  the  specificity  of  dimerisation 

[180] .  Four  amino  acids  within  the  LZ  of  Myc  were  suf¬ 
ficient  for  homodimerisation,  as  well  as  heterodimerisa- 
tion  with  Myc  and  Max.  Furthermore,  a  mutant  protein 
termed  Omomyc,  which  interferes  with  Myc  binding  to 
E-box  elements,  was  able  to  inhibit  colony  formation 
in  NIH3T3  cells  [181],  Introduction  of  Omomyc  in  a 
Myc-induced  skin  tumourigenesis  model  in  mice  [182] 
inhibited  Myc-induced  papillomatosis,  as  well  as  re¬ 
stored  the  normal  keratinocyte  differentiation  program 
and  skin  architecture,  both  of  which  are  otherwise  dis¬ 
rupted  by  Myc  activation  [183].  Importantly,  the  posi¬ 
tive  outcome  of  Omomyc  expression  was  completely 
dependent  on  the  presence  of  an  oncogenic  Myc  and 
therefore  may  be  of  therapeutic  value  for  the  specific  tar¬ 
geting  of  Myc-deregulated  cells  without  affecting  the 
surrounding  normal  cells. 

Knowing  the  structural  domains  required  for  the 
Myc-Max  interaction  inspired  a  dominant-negative  ap¬ 
proach  to  disrupt  this  linkage.  Myc  mutants  expressing 


only  the  b-HLH-LZ  or  HLH-LZ  domain  rapidly  in¬ 
duced  apoptosis  in  3T3-L1  mouse  fibroblasts  [184].  This 
study  provided  the  rationale  to  devise  additional  strate¬ 
gies  to  directly  exploit  Myc  heterodimerisation  for  the 
design  of  novel  therapeutics.  One  of  the  major  chal¬ 
lenges  with  this  approach  lies  in  achieving  efficient  deliv¬ 
ery  of  the  drug  to  the  nucleus  of  tumour  cells  in  vivo.  To 
this  end,  a  mutant  peptide  derived  from  the  helix- 1  re¬ 
gion  of  Myc  was  linked  to  an  internalisation  sequence 
[185].  The  fusion  peptide  interfered  with  the  transcrip¬ 
tional  activity  of  Myc  leading  to  the  inhibition  of 
MCF-7  cell  growth.  The  stability  and  activity  of  this 
peptidomimetic  molecule  was  increased  [186,187]  and  a 
variant  of  the  original  peptidomimetic  has  been  synthes¬ 
ised  and  tested  in  mouse  models.  Interestingly,  analysis 
of  inhibitor  interactions  of  Myc-Max  shows  the  active 
molecules  act  through  key  basic  residues  at  the  outer 
surface  of  the  Myc-Max  heterodimer,  potentially  by 
binding  or  interfering  with  another  interactor.  The  novel 
peptides  were  capable  of  reaching  high  concentrations  in 
mouse  organs  and  were  effective  at  inhibiting  growth  of 
a  colon  cancer  cell  line.  This  peptide  is  10-fold  larger 
than  traditional  small  molecules,  which  may  contribute 
to  its  highly  selective  interference  with  Myc-specific  pro¬ 
tein-protein  interactions  [186].  It  would  be  of  great 
interest  to  test  which  specific  Myc-protein  interaction 
is  inhibited  in  vivo  and  whether  knockdown  of  this  co¬ 
factor  can  directly  trigger  an  anti-tumour  effect  in 
animals. 

Recently,  two  groups  have  shown  that  inhibitors 
blocking  the  Myc-Max  interaction  can  be  isolated  using 
a  high  throughput  screen.  In  one  study,  inhibitors  were 
identified  by  fluorescence  resonance  energy  transfer  in 
high-throughput  screens  of  peptidomimetic  libraries, 
then  confirmed  by  enzyme-linked  immunosorbent  assay 
and  electrophoretic  mobility  shift  assay.  The  antagonists 
interfered  with  Myc-induced  and  Jun-induced  oncogenic 
transformation  suggesting  the  inhibitor  may  also  target 
the  LZ  of  the  Jun  oncoprotein  [188].  The  exact  molecu¬ 
lar  mechanism  of  inhibition  and  the  utility  of  these 
agents  in  the  control  of  carcinogenesis  is  the  focus  of  fu¬ 
ture  analysis.  In  the  second  study,  the  yeast  two-hybrid 
assay  was  used  to  screen  a  library  of  10000  compounds 
to  identify  those  able  to  disrupt  the  interaction  of  the  b- 
HLH-LZ  regions  of  Myc  and  Max,  yet  show  no  general 
toxicity  to  the  yeast  [189].  Several  compounds  were  iden¬ 
tified  and  their  ability  to  inhibit  the  Myc-Max  interac¬ 
tion  confirmed  by  an  in  vitro  association  assay.  Using 
a  reporter  assay,  the  compounds  were  shown  to  inhibit 
Myc  transcriptional  activity  and  proved  to  inhibit 
growth  of  Myc  transformed  rat  fibroblasts,  but  not 
Myc-null  cells.  Finally,  incubation  of  transformed  cells 
with  the  compounds  for  three  days,  prior  to  their  injec¬ 
tion  into  nude  mice,  inhibited  tumour  growth  in  vivo. 
Although  the  compounds  used  by  both  groups  require 
relatively  high  concentrations,  these  promising  studies 
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provide  a  platform  for  future  development  of  more 
effective  small  molecule  inhibitors  of  Myc-Max 
dimerisation. 


5.  Conclusions  and  perspectives 

Targeting  Myc  at  the  level  of  expression  and/or  func¬ 
tion  is  an  effective  approach  to  eliminate  this  potent 
oncoprotein.  The  agents  described  in  this  review  success¬ 
fully  block  Myc  and  have  passed  the  first  hurdle  in  tar¬ 
geted  drug  design.  The  issues  that  remain  to  be  resolved 
include,  but  are  not  limited  to,  evaluating  and  improving 
sensitivity,  specificity,  delivery  and  efficacy  as  a  single 
agent  and  in  combination  with  other  anti-neoplastic 
therapies.  The  agents  that  have  advanced  beyond  Phase 
I  clinical  trials  show  enormous  promise.  It  will  be  fasci¬ 
nating  to  monitor  their  progress  and  learn  whether  tar¬ 
geting  Myc  will  have  the  expected  impact  at  the  level  of 
patient  care. 

There  are  several  additional  approaches  aimed  at  tar¬ 
geting  Myc  activity  that  are  in  the  earliest  stages  of 
development.  Based  on  the  success  of  targeting  Myc- 
Max  interaction,  there  is  significant  interest  in  targeting 
other  key  Myc-protein  partnerships,  such  as  Myc- 
TRRAP.  This  field  of  study  covers  a  broad  spectrum 
of  experimental  analyses,  from  first  identifying  the  inter¬ 
actor,  determining  whether  it  plays  an  important  func¬ 
tional  role  in  Myc  transformation,  through  to 
mapping  the  precise  points  of  interaction.  Many  Myc- 
protein  interactions  highlighted  here  have  been  de¬ 
scribed  only  in  recent  years  and  some  of  them  may 
warrant  inhibitor  development. 

A  second  major  area  of  fundamental  research  that 
has  the  potential  to  impact  therapeutic  design  aims  at 
identifying  and  understanding  Myc  target  genes  and 
the  biological  pathways  they  regulate.  As  a  multifunc¬ 
tional  master  regulator,  Myc  induces  several  genes 
which,  in  turn,  play  a  critical  role  in  transformation 
and  are  now  being  targeted  as  potential  anti-cancer  ther¬ 
apeutics.  For  example,  haplo  insufficiency  for  a  Myc  tar¬ 
get  gene,  ode  reduces  skin  tumourigenesis  in  mice  [190] 
and  a  specific  inhibitor  of  ODC,  2-difluoromethylorni- 
tive,  can  block  Myc-induced  oncogenesis  [191].  Clearly, 
a  greater  understanding  of  the  Myc  transformation  pro¬ 
gram  will  result  in  additional  opportunities  to  target 
Myc  function  in  tumour  cells. 

Another  novel  approach  does  not  target  Myc  directly, 
but  instead  aims  to  exploit  the  ability  of  oncogenic  Myc 
to  potentiate  apoptosis.  A  cooperating  lesion,  such  as 
Bcl-2  activation,  often  inhibits  Myc  potentiation  of 
apoptosis  and  collaborates  with  Myc  to  drive  transfor¬ 
mation.  Blocking  these  anti-apoptotic  molecules  may  re¬ 
lease  the  ability  of  Myc  to  sensitise  tumour  cells  to 
undergo  apoptosis,  elevate  the  therapeutic  index  and 
achieve  tumour  cell  death.  There  is  enormous  effort  to 


understand  the  nature  of  the  genetic  abnormalities  asso¬ 
ciated  with  cancers  that  can  block  Myc-potentiation  of 
apoptosis  and  contribute  to  transformation.  With  this 
knowledge,  the  innovative  approach  of  exploiting  dereg¬ 
ulated  Myc  can  be  further  advanced. 

Myc  deregulation  is  often  associated  with  aggressive 
disease  of  poor  prognosis,  which  augments  the  urgency 
for  novel  therapeutics  targeting  this  potent  oncoprotein. 
Many  such  agents  are  well  along  the  drug  development 
pipeline.  Moreover,  fundamental  research  instructs  us 
daily  of  new  opportunities  to  effectively  target  Myc 
expression  and  function  to  block  malignant  transforma¬ 
tion.  Further  growth  in  the  area  of  anti-Myc  therapeu¬ 
tics  is  warranted  and  anticipated. 
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Abstract 

The  N-terminal  domain  (NTD)  of  cMyc  interacts  with  specific  co-factors  to  regulate  cell  cycle, 
apoptosis  and  tumorigenesis.  TRRAP  is  one  such  cofactor  that  was  isolated  biochemically  by 
virtue  of  its  interaction  with  the  cMyc  NTD.  Here  we  used  the  Repressed  Transactivator  (RTA) 
assay  to  identify  a  region  of  TRRAP  (residues  1665-2030)  that  interacts  with  the  cMyc  NTD. 
Using  multiple  experimental  approaches,  we  have  further  defined  a  region  of  TRRAP  (residues 
1690-1850)  which  is  sufficient  to  bind  to  a  functionally  critical  region  of  cMyc  (residues  120- 
160),  in  a  W135  dependent  manner  both  in  vitro  and  in  vivo ,  establishing  the  essential 
components  of  this  critical  interaction. 


Keywords:  TRRAP,  cMyc,  pulldown  assay,  circular  dichroism,  Repressed  Transactivator,  co- 
immunoprecipitation 
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Introduction 

Deregulated  expression  of  the  c-myc  oncogene  is  a  common  feature  of  numerous  human  cancers  of 
diverse  origin  and  is  often  associated  with  aggressive  disease.  The  product  of  the  human  c-myc  gene,  cMyc, 
is  a  nuclear  phosphoprotein  that  regulates  a  wide  spectrum  of  biological  activities  including  cell 
proliferation,  differentiation  and  programmed  cell  death.  The  C-terminal  domain  of  cMyc  consists  of  helix 
loop  helix  (HLH)  and  leucine  zipper  (LZ)  motifs,  which  are  important  for  cMyc  to  bind  to  its  obligate 
partner,  Max.  The  mechanism  of  cMyc:Max  interaction  with  DNA  has  been  well  characterized,  in  part 
because  the  structure  of  the  C-terminal  has  been  solved.[l]  By  contrast,  the  biophysical  characteristics  of  the 
cMyc  NTD,  alone  and  in  association  with  its  partner  proteins  has  remained  elusive,  despite  the  functional 
importance  of  this  region  for  all  known  biological  activities  of  cMyc. 

The  NTD  of  cMyc  spans  amino  acids  1  to  262  and  contains  three  regions  that  are  highly  conserved 
amongst  all  members  of  the  Myc  family  of  proteins  including  Myc  box  I  (MBI,  residues  45-  63),  MBII 
(128-143)  and  MBIII  (188- 199). [2]  The  transactivation  domain  (TAD),  residues  1  to  143,  is  sufficient  to 
activate  gene  transcription  when  linked  to  a  heterologous  DNA-binding  domain.  [3]  Analysis  of  the  cMyc 
TAD  by  CD  has  shown  that  residues  1-143  harbor  little  or  no  secondary  structural  content.  [4]  Interestingly, 
structure  is  evident  upon  interaction  of  this  region  with  the  basal  transcription  factor,  TATA-binding  protein 
(TBP).[4]  Recent  analysis  of  a  larger  fragment  of  the  cMyc  NTD,  residues  1-167,  which  includes  the 
unstructured  cMyc  TAD,  identified  a  partially  helical  region  and  demonstrated  features  of  a  molten  globule 
upon  specific  binding  to  Myc  interactors  TBP  and  MM-1.[5]  Thus,  evidence  suggests  the  structure  and 
function  of  the  cMyc  NTD  is  highly  dependent  upon  interaction  with  its  partner  proteins  and  operates  on  an 
induced  fit  model  of  interaction.  [6] 

One  of  the  most  intriguing  Myc-binding  proteins  yet  described  is  the  large  (-430  KDa)  ATM/PI3- 
kinase-related  protein  termed,  transformation-transactivation  domain-associated  protein  (TRRAP). 
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Inhibition  of  TRRAP  expression  or  function  blocks  Myc-mediated  oncogenesis, [7]  establishing  an  essential 
role  for  TRRAP  in  cMyc  activity.  Indeed,  the  activation  and  oncogenic  potency  of  the  individual  Myc 
family  members  was  directly  proportional  to  their  ability  to  bind  TRRAP.  [8]  Moreover  the  interaction 
between  cMyc  and  TRRAP  has  been  shown  in  vivo  to  involve  the  NTD  region  of  cMyc  while  disruption  of 
a  key  Tryptophan  (W135)  residue  within  MBII  was  shown  to  abolish  interaction  of  N-Myc  with  TRRAP. 
Taken  together,  work  to  date  clearly  shows  that  TRRAP  holds  a  critical  role  in  the  regulation  and  function 
of  cMyc  and  identifying  the  region  of  cMyc  interaction  of  this  large  protein  will  mark  a  key  advance.  [7] 

In  this  study,  we  utilized  a  novel  RTA  technology  and  identified  a  large  region  of  TRRAP  comprising 
residues  1665-2030  as  important  for  cMyc  interaction.  Using  CD  spectroscopy  a  folded  globular  domain  of 
TRRAP i69o- 1877  has  been  identified  and  shown  to  bind  cMyci2o-i6o  containing  the  conserved  MBII.  We  found 
the  minimal  region  of  TRRAP  comprising  residues  1690-1850  exhibited  binding  to  cMyci2o-i6o  in  a  manner 
that  suggests  a  folding-on-binding  mode  of  interaction  between  these  proteins.  This  interaction  is  dependent 
on  W135  in  cMyc,  a  residue  that  has  been  shown  to  be  required  for  Myc-induced  transformation 
activity.  [10]  Moreover,  our  immunofluorescence  and  immunoprecipitation  results  provide  strong  evidence 
of  an  interaction  between  cMyc  and  TRRAP  in  the  nuclear,  chromatin-enriched  compartment  of  mammalian 
cells. 

Results  and  Discussion 

Repressed  Transactivator  Assay  defines  a  cMyc  interacting  region  of  TRRAP. 

To  facilitate  the  identification  of  the  region  within  TRRAP  that  interacts  with  cMyc,  we  used  an 
innovative  yeast  two-hybrid  system  for  bait  proteins  that  have  intrinsic  transactivation  activity.  We  have 
previously  used  this  system,  the  Repressed  Transactivator  (RTA)  assay,  to  investigate  Myc  NTD 
interactors.  [1 1,12]  In  the  RTA  (Figure  1(a)),  activation  of  ura3  and  lacZ  reporter  genes  by  the  Gal4-cMyc 
NTD  fusion  protein  results  in  the  death  of  the  yeast  when  plated  on  5-flouroorotic  acid  (FOA)  media  and 
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elevated  expression  of  beta-galactosidase  .  The  prey  proteins  in  the  RTA  are  fused  to  the  repression  domain 
of  the  yeast  TUP1  protein  such  that  a  specific  bait: prey  interaction  is  detected  as  repression  of  reporter  gene 
expression  resulting  in  the  growth  of  yeast  on  FOA  and  inhibition  of  beta-galactosidase  activity. 

Using  the  RTA,  several  large  regions  of  TRRAP  were  assayed  for  their  ability  to  interact  with  the  cMyc 
NTD  bait.  The  prey  vector  containing  the  positive  control  TBP,  the  TRRAP  fragments  I-III  (Figure  1(b)),  or 
empty  vector  (pBDH)  were  co-transformed  with  the  Gal4-cMyc  NTD  (Figure  1(c))  into  yeast  and  plated 
onto  media  with  or  without  FOA.  All  yeast  were  able  to  grow  in  the  absence  of  FOA  (Figure  1(c),  upper 
panel),  but  in  the  presence  of  FOA  (Figure  1(c),  lower  panel),  only  TRRAP  fragment  II  (residues  1665- 
2030)  and  TBP  were  able  to  confer  growth-rescue  of  yeast  expressing  the  Gal4-cMyc  NTD.  Suppression  of 
lacZ  expression  to  levels  similar  to  those  of  TBP  was  observed  when  TRRAP  fragment  II  was  co¬ 
transfected  into  yeast  with  the  cMyc-NTD  (Figure  1(d)).  Thus,  using  the  RTA  we  were  able  to  identify  a 
region  of  TRRAP  (1665-2030)  that  binds  the  cMyc  NTD  when  expressed  in  yeast  in  vivo. 

We  further  evaluated  the  region  of  cMyc  that  was  essential  for  TRRAP 1 665-2030  interaction  using  the 
RTA.  MBII  is  necessary  for  Myc  transformation  activity  and  Myc  interaction  with  TRRAP.  [7]  Interestingly 
a  substitution  mutation,  W135E  within  MBII,  has  been  shown  to  be  critical  for  Myc  transformation.  Using 
this  specific  point  mutant  we  evaluated  TRRAP  1 665-2030  interaction  with  the  wildtype  and  W135E  cMyc 
NTD.  Growth  rescue  in  the  presence  of  FOA  was  achieved  with  expression  of  TRRAP  1 665-2030  with  wildtype 
cMyc  NTD,  but  significantly  diminished  in  the  presence  of  Gal4-cMycNTD  W135E  (Figure  1(e)).  These 
data  suggest  that  residue  W135  plays  a  key  role  in  cMyc:TRRAP  interaction. 

Minimal  cMyc  binding  region 

To  further  define  the  minimal  interaction  domains  of  the  cMyc:TRRAP  complex,  twelve  domain-sized 
fragments,  surrounding  the  cMyc  binding  region  of  the  TRRAP  1 690-2026  were  cloned  and  tested  for 
expression,  solubility  and  interaction  with  cMyc  (Figure  2(a)).  All  the  fragments  were  insoluble  when 
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expressed  in  E.coli.  To  obtain  soluble  proteins  for  interaction  studies,  a  refolding  protocol  (see  Materials 
and  Methods)  was  developed  for  several  TRRAP  fragments. 

TRRAP 1690-2026  was  expressed  at  low  levels  and  was  proteolytically  degraded  into  a  smaller  fragment 
with  a  molecular  weight  corresponding  to  that  of  TRRAP1690-1899  (confirmed  by  Mass  Spectrometry), 
suggesting  the  presence  of  a  protease  resistant  domain.  Because  of  the  low  expression  level  of  TRRAPi69o- 
1899,  we  designed  and  cloned  a  series  of  smaller  TRRAPi690-i899  fragments  (Figure  2(a)).  TRRAPi690-i85o  and 
TRRAP 1690-1877  expressed  well,  but  extensive  precipitation  precluded  refolding  of  these  individual 
polypeptides  (5%-10%  refolding  yield).  However,  when  both  TRRAP)69o-i85o  and  TRRAP i69o-i 877  were 
solubilized  with  denaturant  and  refolded  in  the  presence  of  cMyci2o-i6o,  these  proteins  remained  soluble 
suggesting  that  the  folding  of  these  two  fragments  was  facilitated  by  cMyci2o-i6o  (40-50%  refolding  yield). 
TRRAP  1690- 1877  has  better  solubililty  properties  and  was  advanced  for  further  cMyc:TRRAP  in  vitro 
interactions  studies. 

While  this  work  was  in  progress  Park  et  al.  described  the  results  of  their  mapping  studies  to  identify  the 
region  of  TRRAP  that  interacts  with  cMyc  and  suggested  that  the  minimal  region  of  cMyc  interaction  may 
reside  in  TRRAP 1 899-2026.  [12]  To  directly  evaluate  whether  this  region  did  indeed  bind  to  cMyc,  we  cloned 
and  expressed  this  fragment  in  a  similar  manner  to  the  other  fragments  described  above.  TRRAP  1 899-2026  was 
the  only  fragment  that  could  be  expressed  in  soluble  form  and  therefore  it  was  further  evaluated  for  cMyc 
NTD  interaction. 

Biophysical  analysis  of  domains 

We  used  CD  and  NMR  spectroscopy  to  further  characterize  two  cMyc  NTD  (Figure  2(b))  and  TRRAP 
polypeptides.  The  CD  spectrum  of  cMyci-i53  shows  little  negative  elliplicity  at  approximately  222  nm  and  a 
minimum  shifted  towards  200  nm  which  indicates  a  largely  disordered  structure  with  low  helical  content 
(Figure  3(a)).  A  comparison  of  the  CD  spectrum  of  cMyci-153  with  that  previously  published  for  Myci.143 
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indicates  similar  secondary  structural  properties.  Recent  analysis  of  Myci_i67  suggest  that  residues  140-167 
contribute  to  the  folding  of  the  entire  cMyc  TAD  region. [5]  This  may  explain  why  our  cMyci-153  (lacking 
residues  154-167)  had  a  CD  spectrum  reflecting  significantly  less  secondary  structure  than  that  for  Myci_i67* 
The  CD  spectra  of  cMyci2o-i6o  on  the  other  hand  displays  negative  minima  near  208  nm  and  222  nm  (Figure 
3(a)),  consistent  with  some  a  helical  content  and  is  similar  to  CD  spectra  of  Myci_i67  and  Myc92-i67-[5]  This 
region  contains  MBII  residues  128-143  which  are  conserved  in  all  Myc  family  proteins  and  known  to  be 
essential  for  the  cMyc:TRRAP  interaction. 

The  ['H-15N]HSQC  spectrum  of  cMyci2o-i6o  (Figure  3(c))  a  soluble,  partially  folded  protein  with  the 
approximate  number  of  expected  cross  peaks.  The  non-uniform  peak  intensity  and  lack  of  chemical  shift 
dispersion  are  both  consistent  with  a  dynamically  disordered  and/or  partially  helical  polypeptide  as 
suggested  by  the  CD  spectra.  The  HSQC  of  cMyci-153  (not  shown)  reflects  a  primarily  disordered  protein 
with  extensive  spectral  overlap.  Therefore,  cMyci2o-i6o,  was  advanced  for  further  cMyc:TRRAP  interaction 
studies. 

The  U1-15N-TRRAP  1899.2026  HSQC  spectrum  reflects  a  soluble,  non-aggregated,  but  disordered  protein 
with  very  little  chemical  shift  dispersion,  and  uniformly  narrow  peak  width  (Fig.  3(d)).  Addition  of  cMyci_ 
153  had  no  effect  on  the  spectral  features  of  U1-15N-TRRAPi 899-2026  (data  not  shown).  We  used  GST 
pulldown  assay  to  (see  below)  to  evaluate  the  interaction  of  TRRAPi 899-2026  with  cMyci2o-i6o-  No  interaction 
was  observed  between  these  proteins.  Therefore,  we  conclude  that  there  is  very  little  or  no  interaction 
between  TRRAPj 899-2026  and  both  cMyc  fragments  under  these  conditions,  as  a  complex  could  not  be 
identified  by  either  NMR,  or  by  GST  pull  down  assays  at  millimolar  protein  concentrations. 

Using  CD  spectroscopy  we  have  identified  a  region  of  TRRAP  that  appears  to  have  a  folded,  globular 
domain  (TRRAP1690-1877)  and  is  likely  to  form  a  complex  with  the  MBII.  While  the  spectrum  of  cMyci2o-i60 
alone  was  characteristic  of  a  mix  of  helical  and  random  coil  conformation,  the  CD  spectrum  of  TRRAP1690- 


7 


SUBMITTED 


1877  fragment  alone  had  a  strong  minimum  near  217  nm,  characteristic  of  a  (3-sheet  structure.  The  spectrum 
of  a  stoichiometric  mixture  of  the  two  proteins  is  clearly  distinct  from  those  of  TRRAP1690-1877  and  cMyc  120- 
i6o  alone,  and  from  the  sum  of  the  two  individual  CD  spectra  with  an  obvious  minimum  at  222  nm  (Fig. 
3(b))  indicative  of  the  conversion  of  a  random  coil  to  an  a-helical  conformation,  and  consistent  with  an 
interaction  between  these  domains. 

In  addition,  we  focused  on  a  particular  residue  within  MBII,  W135,  that  has  been  shown  to  be  required 
for  Myc-induced  transformation.  Interestingly  when  the  cMyci2o-i6o  mutant  ,  W135E,  was  mixed  in 
stoichiometric  amounts  with  TRRAP1690-1877  the  CD  spectrum  of  the  mixture  showed  no  difference  from  the 
sum  of  the  individual  spectra  suggesting  no  interaction  to  place  (Figure  3b). 

In  vitro  binding  confirms  cMyc:  TRRAP  minimal  domains 

A  more  direct  assay  of  protein:protein  interactions  using  the  in  vitro  GST  pulldown  technique  was 
performed  to  evaluate  the  interaction  of  between  cMyc  and  TRRAP.  Immobilized  GST-cMyci2o-i6o  was 
incubated  with  purified  His-tagged  TRRAP|69o-i877,  TRRAPi69o-i850  or  TRRAP  1899-2021-  The  GST-cMyc^o- 
i6oW135E  was  incubated  with  His  tagged  TRRAPi69o-i877-  The  results  clearly  showed  an  interaction  of 
cMyci2o-i6o  with  both  TRRAP i69o-i 877  and  TRRAPi 590-1 850,  but  not  with  TRRAP  1399-2021  (Figure  4).  No 
interaction  was  observed  between  cMyci2o-i6oW135E  and  TRRAP i69o-i 877  (Figure  4(d)).  Together,  these  data 
indicate  that  cMyci2o-i6o  can  bind  to  TRRAPi69o-i85o  and  TRRAP)69o-i877  and  a  nonconservative  mutation 
within  MBII,  W135E,  was  able  to  disrupt  the  binding  between  these  proteins.  This  is  consistent  with  our 
RTA  (Fig.  1(e))  and  the  CD  spectra  which  could  be  explained  by  either  a  destabilizing  affect  on  the 
structural  integrity  of  MBII,  or  direct  participation  of  W135  in  the  cMyc:TRRAP  interaction.  Thus, 
consistent  with  previous  reports,  we  conclude  that  MBII  sequence  is  required  for  efficient  interaction  of  the 
cMyc  NTD  and  TRRAP. 

TRRAP1690 -1850  interacts  with  cMyc  in  vivo. 
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We  further  investigated  whether  the  interaction  of  TRRAP1690-1850  and  cMyc  was  detectable  in 
mammalian  cells  in  vivo.  To  this  end,  293TV  cells  were  transiently  transfected  with  pCMVIO  expression 
plasmid  carrying  a  fusion  protein  containing  a  FLAG-tagged  nuclear  localization  signal  (NLS)  linked  in- 
frame  with  TRRAPi69o-i85o.  To  investigate  the  sub-cellular  localization  of  TRRAPi690-i850,  we  conducted 
immunofluorescence  studies  and  biochemical  analyses.  Cells  were  fixed  and  immuno-stained  with 
polyclonal  anti-Myc  and  monoclonal  anti-FLAG  antibodies  to  detect  Myc  and  TRRAP1690-1850  respectively. 
As  shown  in  Figure  5(a-d),  TRRAP1690-1850  and  cMyc  primarily  co-localize  in  the  nucleus  of  293TV  cells,  in 
a  diffuse  dot  pattern  (white  arrows  in  Figure  5(c)).  In  addition,  a  less  prevalent  staining  of  TRRAP1690-1850  is 
shown  in  the  cytoplasmic  compartment,  most  likely  due  to  over-expression  of  the  protein.  For  biochemical 
analysis,  cell  lysates  were  harvested  and  sub-cellular  fractions  prepared.  Equivalent  volumes  of  cytoplasmic, 
nuclear  and  chromatin-enriched  fractions  were  separated  on  a  12%  SDS-PAGE  gel  (Figure  5(e)).  In  addition 
to  anti-FLAG,  immunoblots  were  probed  with  anti-tubulin  and  anti-acetylated  histone  H3  to  monitor 
efficiency  of  cellular  fractionation.  Interestingly,  Flag-NLS-TRRAPi69o-i85o  was  highly  enriched  in  the 
chromatin  fractions,  yet  also  evident  in  the  cytoplasmic  fraction,  in  accordance  with  the 
immunofluorescence  results.  To  determine  if  TRRAP1690-1850  interacts  with  full  length  cMyc  in  vivo  after 
transient  co-expression  in  293TV  cells  we  conduced  co-immunoprecipitation  experiments.  Cell  lysates  were 
immunoprecipitated  with  anti-Myc  antibodies  (N-262)  followed  by  immunoblotting  with  anti-FLAG 
antibodies  to  identify  FLAG-TRRAP1690-1850  protein  (Figure  5(f)).  As  shown  in  Figure  5(f)  lane  4,  FLAG- 
TRRAP 1690-1850  was  specifically  co-immunoprecipitated  by  cMyc  and  not  by  rabbit  IgG  (lane  3). 

Our  immunofluorescence  and  immunoprecipitation  results  provide  strong  evidence  of  an  interaction 
between  cMyc  and  TRRAP  in  the  nuclear,  chromatin-enriched  compartment  of  mammalian  cells.  Indeed, 
ectopically  expressed  TRRAP  colocalizes  with  cMyc  in  293TV  cells.  As  well  TRRAPi69o-i85o  resides  in  the 
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chromatin-enriched  fraction,  along  with  cMyc,  and  is  co-immunoprecipitated  with  ectopic  cMyc  from 
mammalian  cells. 

Summary 

In  this  study,  we  provide  evidence  of  a  physical  interaction  between  cMyc  and  TRRAPi690-ib77  from 
analyses  conducted  both  in  vitro  (CD,  GST-pulldown)  and  in  vivo  (co-IF  and  co-IP  in  mammalian  cells). 
The  present  results  thus  suggest  that  cMycno-no  contributes  significantly  both  to  folding  and  binding 
properties,  and  may  be  required  for  full  biophysical  and  biological  functionality  of  the  cMyc  TAD  region. 
Importantly,  our  finding  of  increased  secondary  structure  in  the  complex  compared  to  that  of  the  individual 
polypeptides  and  the  improved  behavior  of  TRRAPi69o-i877  in  the  presence  of  cMyci2o-i6o  suggest  an 
induced-fit  model  of  cMyc  NTD  protein:protein  interaction. 

Our  work  extends  a  previous  report  using  GST  pull  down  assays  to  show  that  cMyc  and  TRRAPi59i_2026 
can  interact,  but  are  inconsistent  with  previous  co-immunoprecipitation  data  showing  cMyc  and  TRRAP1899- 
2401  can  interact.  It  may  be  that  another  cMyc-binding  region  resides  in  this  latter  portion  of  TRRAP, 
however  our  results  are  not  supportive  of  this  possibility.  We  show  that  cMyc  does  not  interact  with 
TRRAP  1899-2026  in  vitro,  and  a  region  of  TRRAP  corresponding  to  residues  2000-2365  (Figure  1)  did  not 
interact  with  the  cMyc  NTD  in  the  yeast  RTA  assay  in  vivo.  Our  preferred  model  is  that  the  weak 
interaction  observed  between  cMyc  and  TRRAP 1 899-2401  in  mammalian  cells  may  be  a  consequence  of 
indirect  binding  of  cMyc  to  another  transcriptional  activator  that  binds  TRRAP  in  this  region.  Indeed,  the 
large  TRRAP  protein  interacts  with  several  transcriptional  activators  through  distinct  regions  of  interaction 
and  has  been  suggested  to  bridge  functional  binding  interactions  between  different  activators. [13]  It  will  be 
interesting  to  further  dissect  which  activator(s)  binds  to  TRRAP  1 899-2401,  as  previous  results  have  shown 
ectopic  expression  of  this  region  can  prolong  the  doubling  time  of  IRM-5  neuroblastoma  cells.  For  example, 
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p53  has  been  shown  to  bind  TRRAP 1992-2370 [14]  and  may  be  one  of  many  candidate  activators  that  bind 
TRRAP 1899-2401  to  repress  cellular  proliferation. 

The  specific  interaction  of  cMyci2o-i60  and  TRRAP i69o-i 877  provides  a  novel  protein:protein  interaction 
that  is  an  attractive  target  for  therapeutic  intervention  in  cMyc  activated  cancers.  These  results  establish  the 
basis  for  subsequent  biochemical  and  structural  analyses  of  the  cMyc:TRRAP  complex  leading  to  a  more 
comprehensive  view  of  the  cMyc  regulatory  network  and  will  provide  new  insights  into  understanding  how 
cMyc  function  is  regulated  by  interaction  with  the  transcriptional  coactivator  TRRAP. 

Materials  and  Methods 

Protein  expression  and  purification 

All  proteins  were  expressed  in  E.  coli  BL21-CodonPlus  (DE3)-RIL  cells  (Stratagene)  and  purified  using 
metal  affinity  chromatography  under  native  conditions  for  the  TRRAP  1 899-2026  and  in  the  presence  of 
guanidinium  chloride  (GnCl)  for  cMyc  and  other  TRRAP  fragments.  The  latter  underwent  subsequent 
refolding  by  rapid  dilution  and  dialysis  against  25mM  TRIS  pH  8,  500mM  NaCl,  2  mM  MgCl2,  10% 
Glycerol,  0.55M  GnCl  followed  by  dialysis  against  the  same  buffer  without  glycerol  and  GnCl.  cMyci20-i6o 
and  cMyci-153  were  refolded  by  2-step  dialysis  in  25mM  NaH2P04,  150mM  NaCl,  pH  6.5,  3M  GnCl, 
followed  by  dialysis  against  the  same  buffer  without  GnCl.  All  buffers  contained  2mM  benzamidine, 
0.5mM  PMSF  and  0.5mM  TCEP. 

GST  pull  down  assay 

Purified  TRRAP 1 899-2026,  TRRAPi69o-i85o  and  TRRAP i69o-i 877  HIS-tagged  proteins  were  incubated  with  GST- 
fusion  cMyci20-i6o  and  cMyci20-i6o(W135E)  bound  to  the  GST  beads  at  4°C  for  lhr.  After  extensive  washing 
with  assay  buffer,  bound  proteins  were  eluted  with  20mM  reduced  glutathione  and  detected  by  SDS-PAGE 
gel.  Control  assays  with  GST  protein  were  also  completed  to  validate  results. 
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CD  and  NMR  spectroscopy 

CD  scans  were  performed  on  AVIV  62DS  spectrometer.  Protein  concentrations  used  for  CD  were  0.1- 
0.15mg/ml.  An  equimolar  solution  of  cMyci2o-i6o  and  TRRAPi690-i877  was  used  for  the  mixed  spectra.  The 
mean  residue  elliplicity  (0)  is  expressed  in  units  of  10'  deg.  cnrdmor  . 

All  NMR  spectra  were  recorded  at  25°C  on  Varian  INOVA  500  MHz  spectrometer  equipped  with  room 
temperature  triple-resonance  probe.  The  final  NMR  samples  contained  90%H20/10%D20  with  a  protein 
concentration  0. 2-0.4  mM. 

Immunoprecipitation  and  Immunoblot 

Co-immunoprecipitation  was  preformed  using  lmg  of  whole  cell  lysates  prepared  using  lysis  buffer  A 
(15mM  Tris-HCL  pH  7.5,  500mM  Nacl,  0.35%  NP-40,  5mM  EGTA,  5mM  EDTA)  and  incubated  over¬ 
night  with  lOpl  of  anti-Myc  (rabbit  sera)  or  IgG  (Santa  Cruz)  and  G-sepharose  beads  (Santa  Cruz)  ..  After  5 
washes  in  Buffer  B  (25mM  Tris-HCL  pH  7.5,  250mM  Nacl,  0.5%  NP-40,  ImM  EDTA,  10%  glycerol,  ImM 
DTT)  immunoprecipitates  were  resolved  by  12%  SDS-PAGE  and  subjected  to  immunoblot  analysis  using 
anti-Myc  (9E10)  and  anti-FLAG  monoclonal  antibodies  (M2;  Sigma). 

Chromatin  association  assay 

293TV  cell  transiently  expressing  pCMVIO-FLAG-  TRRAPi690-i85o  were  processed  as  described  by 
Wysocka  et  al.  [15] 

Immunofluorescence 

Transiently  transfected  293TV  cells  were  fixed  in  3%  paraformaldehyde,  permeabilized  in  0.1%  Tween 
20/PBS,  and  incubated  with  anti-Myc  (1:500,  N-262)  or  anti-FLAG  (1:50,  M2)  antibodies.  Cells  were 
washed,  incubated  with  Alexa-488  anti-mouse  or  Alexa-568  anti-rabbit  (Molecular  Probes,  Inc.)  secondary 
antibodies  and  processed  for  confocal  microscopy. 
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Figure  1.  TRRAPi 690-2030  interacts  with  Gal4-Myc  NTD.  (a)  Interaction  between  the  bait  and  prey  in  the 
RTA  results  in  repression  of  reporter  genes  ura3  and  lacZ  as  detected  by  yeast  cell  growth  on  FOA  and 
inhibition  of  beta-galactosidase  activity,  respectively,  (b)  Schematic  representation  of  TRRAP  showing 
relative  position  of  the  fragments  analyzed  in  the  RTA.  (c)  Gal4-Myc  NTD  bait  was  co-transformed 
into  yeast  cells  with  TBP  (positive  control)  or  TRRAP  fragments  or  the  repression  domain  alone 
(pBDH).  Resulting  colonies  were  plated  onto  +/-FOA  media,  (d)  Yeast  co-transfected  as  in  (c)  were 
evaluated  for  lacZ  expression,  (e)  Yeast  co-transfected  with  Gal4-Myc  NTD  wildtype  (wt)  or  Gal4-Myc- 
NTDW135E  with  TRRAP  fragment  II  were  evaluated  for  growth  in  the  presence  of  FOA. 

Figure  2.  TRRAPi 690-2026  central  region  expression  constructs  with  indicated  expression  levels:  (-)  no 
expression  ,  (+)  low  level  (lmg/lL  LB),  and  (+++)  good  level  of  expression  (8-12mg/lL  LB)  (b) 
Schematic  representation  of  human  cMyc  highlighting  the  relative  location  of  the  TAD  as  well  as  the 
conserved  domains  within  the  NTD.  The  NTD  expression  constructs  are  also  indicated. 

Figure  3.  Secondary  structure  characterization  of  cMyc  and  TRRAP  proteins:  (a)  CD  spectra  of  cMyci. 
153  and  cMyci2o-i6o-  (b)  Interaction  of  cMyci2o-i6o  with  TRRAP  causes  a  conformational  change.  The  CD 
spectrum  was  measured  for  TRRAP i69o-i 877  and  cMyci2o-i6o  alone  and  for  a  mixture  of  two  proteins, 
TRRAP  1 690-  1877/cMyc  120- 160,  as  well  as  for  TRRAPi69o-i877/cMyci2o-i6oW135E.  [^N-'^HSQC  spectrum 
of:  (c)  cMycno-leo  and  (d)  TRRAPi 899-2026. 

Figure  4.  TRRAP i69o-i 877  and  TRRAPi69o-i85o  mediate  cMyci2o-i6o  binding.  Immobilized  GST-  cMyc^o- 
160  co-incubated  with  His-tagged:  (a)  TRRAPi 899-2026;  (b)  TRRAP i69o-i 877;  and  (c)  TRRAP 1690-i 877 
polypeptides  (L).  (d)  Immobilized  GST-Myci2o-i6oW135E  mutant  co-incubated  with  TRRAPi69o-i877  (L). 
After  washing  with  PBSxl  buffer  and  elution  with  reduced  glutatione  buffer  (E),  interacting  proteins 
were  subjected  to  SDS  PAGE  analysis.  (*)  indicates  degradation  products. 
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Figure  5.  TRRAP1690-1850  interacts  with  c-Myc  in  the  nucleus  of  mammalian  cells.  293TV  cells  (a-d) 
were  transiently  transfected  with  FLAG-NLS-TRRAP1690-1850  and  c-Myc,  Cells  were  stained  with  anti- 
FLAG  antibody  and  anti-Myc  antibodies  and  processed  for  confocal  imaging,  (a)  Green 
immunofluorescence  staining  shows  nuclear  and  some  cytoplasmic  localization  of  TRRAPi69o-i85o-  (b) 
Nuclear  localization  of  c-Myc  was  shown  by  red  fluorescence,  (c).  Merged  fluorescence  images  indicate 
nuclear  co-localization  of  TRRAPi69o-i85o  and  c-Myc  (white  arrows),  (d)  Dapi  staining  (e)  293TV  cells 
transfected  with  FLAG-NLS-TRRAP1690-1850  plasmid  were  harvested  and  sub-cellular  fractions 
prepared.  Equivalent  volume  subfractions  were  separated  by  SDS-PAGE.  The  blot  was  probed  with 
anti-FLAG  then  re-probed  with  fractionation  controls,  anti-Histone  H3  and  anti-alpha  tubulin  antibodies, 
(f)  293TV  cells  were  transfected  with  FLAG-NLS-TRRAPi69o-i85o  and  c-Myc.  Total  cell  extract  was 
subjected  to  immunoprecipitation  with  anti-c-Myc  antibodies  or  rabbit  IgG.  Co-immunoprecipitated 
TRRAP 1690-1850  was  visualized  after  SDS-PAGE  and  immunoblotting  with  anti-FLAG  antibodies 
(bottom).  The  upper  part  of  the  blot  was  probed  with  anti-Myc  antibodies  (top);  Input  control  (10%)  is 
shown  in  the  second  lane  while  a  cell  extract  of  mock  transfected  cells  is  shown  in  the  first  lane. 
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ABSTRACT 

High-throughput,  microarray-based  chromatin  immunoprecipitation  (ChIP-chip) 
technology,  allows  in  vivo  elucidation  of  transcriptional  networks.  However,  this 
complex  assay  is  not  readily  accessible,  in  part  because  its  many  parameters  have  not 
been  systematically  evaluated  and  optimized.  We  fill  this  gap  by  addressing 
experimental-design  parameters  including  antibody  purity,  dye-bias,  array-batch,  inter¬ 
day  hybridization,  amplification  method,  and  choice  of  hybridization  control.  The 
combined  performance  of  these  optimized  parameters  shows  high  validation  rates  in  a 
ChIP-chip  analysis  of  Myc  genomic  binding  in  HL60  cells.  Increased  sensitivity  and 
decreased  noise  in  ChIP-chip  assays  will  enable  wider  use  of  this  methodology  to 
accurately  and  affordably  elucidate  transcriptional  networks. 
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BACKGROUND 

The  combination  of  Chromatin  immunoprecipitation  (ChIP)  with  high-throughput  DNA 
microarray  technology  (chip)  provides  a  powerful  method  (ChIP-chip)  for  mapping 
protein-DNA  interactions  in  vivo.  The  ChIP-chip  procedure  was  first  used  to  study  yeast 
transcription  factors  [1-4],  and  has  recently  been  exploited  for  similar  studies  in 
mammalian  cells  [5-14].  In  the  ChIP-chip  procedure  genomic  DNA  is  precipitated  in 
ChIP  reactions  using  specific  and  control  antibodies,  then  labeled  and  hybridized  to 
genomic  microarrays.  Several  different  array  platforms  have  been  used,  including 
proximal  promoter  region  arrays  [11],  CpG  island  arrays  [12,  15]  and  whole-genome 
tiling  arrays  [16,  17].  ChIP-chip  is  an  extremely  versatile  technique,  and  is  capable  of 
identifying  target  genes  whose  regulatory  regions  are  bound  by  transcription  factors 
through  protein-DNA  and/or  protein-protein  interactions.  Moreover,  ChIP-chip  can  be 
applied  to  any  cell  or  tissue,  making  it  possible  to  decode  gene  regulatory  networks  in 
vivo.  Certainly,  the  ability  to  profile  bona  fide  target  genes  has  been  revolutionized  by 
ChIP-chip.  Unlike  mRNA  expression  analysis,  the  genetic  program  specifically  directed 
by  the  transcription  factor  can  be  distinguished  from  subsequent  downstream  regulatory 
events.  [18-22], 

Despite  ChIP-chip  methodology  having  become  the  gold  standard  for  discovering 
the  genomic  binding  sites  of  transcriptional  regulators,  there  is  wide  variability  in 
experimental  design,  which  complicates  and  delays  widespread  application.  ChIP-chip 
experiments  require  careful  optimization  of  a  number  of  parameters.  For  exemple,  the 
number  of  cells  or  amount  of  tissue  used  as  starting  material  varies  widely  from  one 
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study  to  another  [23,  24].  Next,  the  protein-protein  and/or  protein-DNA  cross-linking, 
chromatin  sonication,  as  well  as  antibody  sensitivity  and  purity  characteristics  can  also 
vary  significantly.  In  most  studies,  the  enriched  DNA  recovered  after  the  ChIP  procedure 
is  amplified,  using  a  variety  of  methods  such  as  ligation-mediated  [24,  25],  random 
priming  [26]  and  T7  priming  [27]  PCR.  Finally,  when  the  amplified  and  labeled  DNA  is 
hybridized  to  a  microarray,  a  control  sample  must  be  selected,  and  the  effects  of  array 
batch  and  dye-swap  status  considered. 

Extensive  evaluation  of  experimental  design  parameters  for  mRNA  expression 
arrays  have  been  performed  by  a  number  of  groups  over  the  past  decade.  As  a  result,  the 
key  factors  are  well  understood  and  the  assay  has  been  optimized  [28].  It  is  possible,  for 
example,  to  estimate  the  number  of  biological  replicates  required  to  sufficiently  power  a 
specific  hypothesis-testing  question  [29].  Despite  clear  evidence  that  parameter 
optimization  can  profoundly  impact  array  analysis,  ChIP-chip  design  parameters  have  not 
yet  been  thoroughly  and  systematically  investigated. 

Here,  we  fill  that  gap  by  providing  a  comprehensive  evaluation  of  experimental 
design  parameters  for  ChIP-chip  studies.  Through  a  series  of  validation  studies,  we 
address  both  the  parameters  previously  investigated  for  mRNA  expression  studies  as  well 
as  those  specific  to  ChIP-chip  experiments.  We  exploit  a  well-characterized  system:  the 
genomic  binding  of  the  Myc  oncoprotein  in  HL60  cells,  a  human  myelogenous  leukemia 
cell  line,  combined  with  CpG  island  arrays  [12,  30].  Many  parameters  for  successful 
ChIP-chip  studies  were  analyzed,  including  the  impact  of  antibody  purity,  array  batch 
variability,  dye-bias,  inter-day  hybridization- variability,  amplification  procedure,  and 
hybridization  control.  In  addition,  we  evaluated  the  combined  effect  of  the  optimized 
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parameters  by  conducting  a  Myc  ChIP-chip  study,  which  showed  a  high  rate  of  validation 
by  real  time  PCR.  The  raw  data  from  this  study,  encompassing  nearly  100  arrays,  will  be 
deposited  in  the  GEO  repository  at  NCBI.  Our  careful  description  of  ChIP-chip 
experimental  design  is  a  key  step  towards  enabling  widespread  exploitation  of  this 
important  technology  for  the  rapid  elucidation  of  transcriptional  regulatory  networks. 
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RESULTS 

Our  analysis  of  ChIP-chip  design  parameters  was  conducted  in  the  context  of  the 
genomic-binding  of  the  well  characterized  Myc  oncogene  in  an  acute  myelogenous 
leukemia  cell  line  [12],  HL60  [31].  All  experiments  utilized  12k  CpG  island  microarrays 
[30], 

Antibody  selection,  array  batch,  reciprocal  labeling  and  inter-day 
processing 

The  first  series  of  experiments  focused  on  assessing  four  key  parameters  of  the 
ChIP-chip  assay.  Antibody  selection  was  considered  first  because  the 
immunoprecipitation  step  provides  the  starting  material  for  the  array  analyses,  and  as 
such  is  a  major  determinant  of  the  success  of  ChIP-chip  studies.  Second,  we  addressed 
the  effect  of  array  batch.  This  can  be  a  major  confounding  variable  because  DNA  spotted 
to  a  microarray  slide  may  degrade  or  otherwise  lose  sensitivity,  and  even  arrays  printed  in 
different  print-runs  from  a  single  spotter  can  show  variability.  Third,  we  evaluated  the 
bias  introduced  during  the  process  of  sample  labeling.  This  bias  refers  to  intensity 
differences  between  identical  samples  labeled  with  different  dyes  and  can  arise  in  several 
ways,  including  differential  incorporation  efficiencies  for  the  two  dyes,  fluorophore 
degradation,  as  well  as  photo-bleaching  rates  or  scanner-induced  bias.  Fourth,  we 
determined  whether  day-to-day  variations  in  atmospheric  ozone,  technical  handling, 
scanner  calibration,  or  other,  as  yet  unidentified,  confounders  affect  the  hybridization 
step. 

To  evaluate  the  effect  of  these  ChIP-chip  parameters,  we  conducted  a  saturated, 
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fully  blocked  experiment  (Fig.  1).  First,  six  independent  biological  replicates  of  cross- 
linked  HL60  cells  were  subjected  to  ChIP  with  either  commercial  or  home-made 
polyclonal  antibodies  raised  against  the  N-terminal  262  residues  of  the  Myc  oncoprotein. 
For  every  replicate  ChIP  reaction  we  probed  two  batches  of  arrays  separated  in  print-age 
by  about  6  months  to  evaluate  array  batch  effect.  Further,  each  sample  was  reciprocally 
labeled  to  allow  us  to  quantify  the  effects  of  dye  bias.  Thus,  each  biological  replicate  was 
hybridized  to  four  separate  arrays,  leading  to  24  arrays  for  each  antibody  and  48  arrays  in 
total.  Finally,  these  48  arrays  were  fully  blocked  across  seven  days,  allowing  us  to 
robustly  estimate  the  effect  of  date-of-hybridization  (Fig.  1). 

To  quantify  the  importance  of  antibody  selection  we  compared  two  polyclonal 
antibodies  raised  against  the  same  N-terminal  region  of  the  Myc  protein.  One  antibody 
was  purchased  from  a  commercial  vendor  while  the  other  was  locally  produced  (Fig.  1). 
Across  the  48  arrays  of  our  study,  we  observed  a  significant  antibody  effect.  When  the  p- 
values  of  the  top-ranked  spots  on  the  array  are  compared  for  the  two  antibodies,  a  linear 
trend  is  observed,  suggesting  specificity  is  comparable  (Fig.  2A).  However,  despite 
equivalent  replicate  numbers  and  full-blocking  of  other  parameters,  as  described  below, 
p-values  are  distinctly  lower  for  the  commercial  antibody,  suggesting  that  it  results  in 
better  signal  to  noise  characteristics.  To  increase  the  specific  activity  of  our  home-made 
antibody,  we  subjected  it  to  a  purification  procedure  to  remove  contaminating  serum 
proteins,  and  compared  purified  and  unpurified  home-made  antibody  in  a  similar  fashion 
(Fig.  2B).  A  linear  trend  was  again  observed,  with  the  purified  antibody  showing  lower 
p-values  than  the  unpurified.  Note  that  the  magnitude  of  p-values  in  Figures  2A  and  2B 
are  different  because  of  the  different  replicate  numbers  involved  (Fig.  2A,  n  =  24;  2B, 
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n=4  for  each  antibody).  These  data  suggest  that  antibody  purity  is  a  key  factor 
influencing  the  specificity  and  sensitivity  of  ChIP-chip  studies,  therefore  antibody 
purification  may  improve  the  performance  of  some  antibodies. 

To  evaluate  the  effect  of  array  batch,  dye  bias  and  inter-day  hybridization,  we  fit 
spot-wise  linear  models  to  the  results  of  each  parameter  separately  and  plotted  the 
Gaussian  densities  of  the  magnitudes  of  these  effects  (Fig.  3).  The  curves  in  the  top 
portion  of  the  figure  show  that  for  all  three  sources  of  variability  (dye,  batch  and  date)  the 
majority  of  spots  on  the  array  show  little  to  no  effect.  Most  spots  exhibit  a  fold  change  in 
the  range  of  -0.5  to  +0.5  log2  units,  corresponding  to  an  effect  of  at  most  1.4-fold. 

Overall,  no  more  than  10%  of  spots  on  the  array  show  large  variability  towards  any  of 
these  parameters  -  suggesting  that  both  the  ChIP  protocol  and  the  CpG  island  arrays  used 
are  robust  towards  perturbations  in  any  of  these  factors. 

Despite  this  basal  similarity,  though,  there  are  differences  across  the  three 
parameters.  The  curve  of  dye-bias  shows  significant  asymmetry,  corresponding  to  a 
consistent  Cy3  bias.  While  only  1 1  spots  show  Cy5/Cy3  ratios  of  1.4-fold  or  higher,  685 
spots  show  Cy3/Cy5  ratios  of  1.4-fold  or  greater.  This  marked  asymmetry  highlights  the 
importance  of  balancing  experiments  across  the  two  dyes  used  in  the  study. 

The  effect  of  array  batch  shows  a  similar,  but  smaller  asymmetry.  Older  arrays 
generally  show  lower  signal  intensities  than  newer  arrays,  with  408  spots  showing  an 
old/new  ratio  of  at  least  1.4-fold,  while  only  210  spots  show  a  new/old  ratio  of  similar 
magnitude.  This  variability  suggests  that  experiments  should  be  conducted  with  arrays  of 
a  single  batch,  or  experimental  samples  randomized  across  multiple  batches. 

Surprisingly,  the  effects  of  hybridization  date  affect  about  10%  of  the  spots  on  the 
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array,  despite  the  same,  experienced  hands  conducting  all  hybridizations  in  this  series  of 
experiments.  This  effect  is  symmetrical,  with  approximately  equal  numbers  of  spots 
showing  date-dependent  enrichment  and  depletion.  The  source  of  this  bias  is  unknown. 

It  is  possible  that  atmospheric  ozone  may  have  differed  from  day-to-day.  We  and  others 
have  experienced  Cy5  squelching  due  to  ozone  [32,  33].  However,  in  this  case  ozone  is 
not  likely  to  be  the  source  of  the  bias,  as  these  experiments  were  conducted  during  times 
of  consistent  low  atmospheric  ozone.  It  is  also  possible  that  variation  in  handling  at  the 
level  of  post-hybridization  washes  may  contribute  to  this  inter-day  variability.  Another 
possibility  is  that  scanner  calibration  varied  across  the  course  of  our  experiment,  but  the 
biases  described  here  are  not  temporally-dependent  (data  not  shown).  This  suggests  that 
if  scanner-calibration  is  an  issue,  then  it  is  a  random  drift  rather  than  a  progressive 
degradation. 

Amplification  method 

Sufficient  DNA  for  microarray  hybridization  cannot  usually  be  obtained  from  a 
single  ChIP  reaction.  The  amount  required  can  be  achieved  by  pooling  multiple  ChIP 
reactions  [12],  but  this  approach  is  expensive  and  labor  intensive.  Further,  when  dealing 
with  biologically  limited  samples,  such  as  primary  tumor  material,  repeated  ChIPs  may 
not  be  possible.  Indeed,  in  such  cases  repeated  ChIPs  may  introduce  intra-sample 
heterogeneity  as  an  additional  confounding  variable  [34].  To  overcome  this  rate-limiting 
step,  the  precipitated  DNA  in  a  single  ChIP  reaction  is  usually  amplified.  A  variety  of 
PCR-based  methods  exists,  including  ligation-mediated  PCR  (LM-PCR),  random  priming 
PCR  (RP-PCR)  and  T7  priming  PCR  methods.  Here  we  develop  an  assay  to  evaluate  the 
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bias  introduced  by  a  given  amplification  protocol.  We  test  the  two  most  common 
amplification  techniques  employed  in  ChIP-chip  experiments,  RP-PCR  and  LM-PCR 
(Fig.  4A)  and  for  the  LM-PCR  method  we  evaluate  protocols  with  either  low  (LM-PCR 
A)  or  high  (LM-PCR  B)  cycle  number.  Total  genomic  DNA  was  amplified  with  one  of 
the  three  protocols,  then  hybridized  against  an  equal  amount  of  unamplified  total 
genomic  DNA  (Fig.  4A).  As  shown  in  Figure  4B,  for  any  given  p-value  threshold,  LM- 
PCR  A  shows  a  smaller  number  of  differentially  bound  spots  than  either  RP-PCR  or  LM- 
PCR  B.  Thus,  we  show  that  it  is  possible  to  evaluate  a  variety  of  amplification  protocols, 
using  genomic  DNA  to  identify  which  method  minimizes  bias. 

Hybridization  Control 

A  variety  of  different  hybridization  controls  have  been  used  for  ChIP-chip  studies. 
For  example,  some  groups  have  employed  a  “no  antibody”  control  [12],  while  others 
have  used  total  genomic  DNA  [11,  35].  To  determine  if  there  are  significant  differences 
between  these  hybridization  controls,  we  designed  a  series  of  experiments  to  test  each 
approach  (Fig.  5A).  Specifically,  we  compared  direct  hybridization  of  ChIP  samples 
against  a  mock  treated  control  (Direct  No  Ab)  and  against  an  IgG  antibody  control 
(Direct  IgG).  We  also  considered  the  use  of  total  input  DNA  as  a  “denominator”  by 
hybridizing  ChIP  and  IgG  samples  separately  against  total-input  DNA  (Indirect  No  Ab), 
then  comparing  their  intensities  indirectly  during  statistical  analysis  by  using  a  linear 
model.  The  results  (Fig.  5B)  are  surprisingly  invariant  from  one  control  to  the  next,  and  at 
any  given  p-value  the  number  of  hits  identified  in  each  experiment  is  nearly  identical. 
Nevertheless,  the  two  direct  designs  show  more  sensitivity  than  the  indirect  approach. 
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This  strongly  suggests  that  direct  controls  are  superior  to  the  indirect  approach,  despite 
the  loss  of  the  “denominator  effect”.  Another  feature  that  adds  merit  to  the  direct  design 
method,  is  that  half  as  many  arrays  are  required  to  analyze  each  biological  replicate, 
compared  to  the  indirect  approach,  thereby  significantly  decreasing  the  cost  of  ChIP-chip 
experiments. 

c-Myc  ChIP-chip  parameter  choice 

We  conducted  a  Myc  ChIP-chip  experiment  using  the  optimized  parameters 
determined  above.  Thus,  we  used  a  purified  antibody  for  the  ChIP  reaction,  DNA  was 
amplified  by  LM-PCR,  we  hybridized  a  single  CpG  island  array  batch  from  a  recent  print 
run,  and  IgG  was  the  choice  as  a  hybridization  control.  Dye  swaps  were  randomized 
across  hybridization  dates,  and  the  entire  experiment  was  conducted  at  high  replication 
(n=13).  By  using  this  highly  optimized  design  we  expected  to  both  minimize  bias  and 
increase  specificity.  Following  pre-processing  and  statistical  analysis  of  the  raw  data  we 
obtained  a  list  of  -535  Myc  bound  spots  (Supplementary  Table  1).  Importantly,  35  of 
these  535  spots  correspond  to  known  Myc  targets  annotated  in  the  Myc  target  gene 
database  [36,  37].  To  validate  some  of  the  novel  target  genes  we  performed  Q-PCR  from 
4  conventional  Myc  ChIP  biological  replicates  performed  in  HL60  cells.  Importantly, 
these  reactions  used  biologically  independent  samples  that  were  completely  separate  from 
any  of  the  samples  used  in  the  optimization  studies  or  optimized  array  work.  As  a 
positive  control  we  show  that  Myc  is  bound  to  the  Cyclin  D2  promoter  and  as  a  negative 
control  that  it  is  not  bound  to  the  chromosome  21  E-Box  (Fig.  6).  We  designed  the  PCR 
primers  against  the  promoter  regions  of  five  spots  taken  from  the  top  150  spots,  ranked 
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by  p- value,  on  the  list.  Four  of  these  five  ( GDAP2 ,  RPL34,  PHLPP  and  TCERG1)  were 
confirmed  as  being  bound  by  Myc  in  growing  HL60  cells  (Fig.  6).  Following  the  same 
criteria,  we  designed  primers  against  the  promoter  regions  of  four  spots  taken  from  the 
middle  third  of  the  list,  and  all  four  ( CDC14A ,  CDCA2  and  NOL6)  were  confirmed  as 
being  bound  by  Myc  (Fig.  6).  Finally,  we  designed  primers  against  the  promoter  regions 
of  four  spots  from  the  bottom  third  of  the  list,  and  three  of  these  four  ( PLTP ,  KLC4  and 
GAS1 )  were  bound  by  Myc  (Fig.  6).  Thus,  with  our  well  established  ChIP-chip 
experimental  parameters  we  efficiently  identified  multiple  novel  bona  fide  Myc  target 
genes.  Importantly,  high  validation  rates  were  observed  across  a  wide  range  of  p-values, 
suggesting  that  our  analysis  is  conservative,  and  that  additional  Myc  targets  could  be 
identified  by  relaxing  the  p-value  threshold  without  significantly  compromising 
specificity. 
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DISCUSSION 

ChIP-chip  is  a  relatively  new  technological  advance  that  has  quickly  become  the 
gold  standard  for  screening  protein-DNA  interactions  in  vivo.  However,  it  is  also  a  highly 
complex  method,  with  many  tunable  parameters.  We  set  out  to  systematically  test  a  large 
subset  of  these  experimental  design  parameters.  To  do  so,  we  employed  a  well- 
characterized  system  involving  the  Myc  oncogene  as  a  regulator  of  gene  transcription  in 
the  HL60  human  myeloid  leukemia  cells  [12].  While  this  means  that  our  results  are 
formally  restricted  to  a  single  biological  system,  the  general  nature  of  the  parameters 
evaluated  indicates  a  much  broader  relevance  to  ChIP-chip  studies  across  many  systems. 

The  first  experimental  design  parameter  considered  was  the  influence  of  different 
antibodies  on  the  results  of  a  ChIP-chip  study.  Intuitively,  the  nature  of  the  antibody  is  a 
limiting  step  for  ChIP-chip  studies.  To  put  a  lower-limit  on  antibody-specific  variability, 
we  selected  two  polyclonal  antibodies  raised  against  the  same  portion  of  the  Myc  protein 
(the  N-terminal  262  residues,  N262)  that  were  derived  from  different  sources.  Critically, 
the  two  antibodies  showed  a  broadly  linear,  well-ordered  trend  (p  <  2.2  x  10  I6;  Wilcoxon 
Rank  Test).  However,  the  significance  of  differential  hybridization  varied  substantially, 
with  the  commercially- available  antibody  being  much  more  sensitive  (Fig.  2A).  We  then 
subjected  the  locally-generated  N262  antibody  to  a  column  purification  procedure  to 
remove  contaminating  serum  proteins.  The  performance  of  this  purified  antibody  is 
significantly  improved  compared  to  the  unpurified  antibody  (Fig.  2B).  Clearly,  antibody 
selection  is  a  critical  element  of  ChIP-chip  studies,  and  antibody  specific  activity  is  an 
important  parameter  to  consider  prior  to  hybridization. 

Next  we  considered  the  effects  of  inter-batch  variability  of  the  CpG  island 
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microarrays.  This  array  platform  has  been  widely  used  by  other  groups  [26,  35,  38], 
giving  confidence  in  its  reliability.  We  tested  the  variability  between  two  batches  of 
arrays  in  a  multi-factorial  experiment.  A  consistent  signal-intensity  bias  was  observed, 
with  the  older  array  batch  showing  reduced  signal  relative  to  the  newer  batch  (Fig.  3). 
This  bias  is  highly  reproducible,  as  it  was  evident  in  an  analysis  of  24  arrays  from  each 
batch.  Bias  of  this  magnitude  is  sufficient  to  create  both  false  positives  and  false- 
negatives.  Fortunately,  there  are  straight-forward  methods  for  controlling  this  factor. 

First,  whenever  possible,  arrays  from  a  single  batch  should  be  used.  Second,  when 
multiple  batches  are  used,  samples  should  be  randomized  or  blocked  across  batches. 
Third,  if  linear-modeling  methods  are  employed,  explicit  batch  terms  should  be  included 
in  model-specification  to  remove  this  variability.  The  first  two  methods  are  standard 
experimental-design  practices,  while  the  third  has  been  successfully  used  in  the  analysis 
of  mRNA  expression  array  data  [39]. 

Our  third  question  involves  the  effects  of  reciprocal-labeling  (dye-swapping)  a 
single  sample  with  the  Cy3  and  Cy5  dyes.  This  topic  has  been  extensively  studied  from 
both  theoretical  [40,  41],  and  empirical  [42]  perspectives.  Even  if  reciprocal  labeling  has 
been  shown  to  have  less  experimental  value  than  true  biological  replicates,  they  appear  to 
carry  more  information  than  simple  repeated  measures  [42],  To  our  surprise,  very 
moderate  effects  of  reciprocal  labeling  were  observed,  with  only  16  out  of  the  >12,000 
spots  on  the  array  showing  a  two-fold  change  in  signal  intensity  based  on  dye-status  (Fig. 
3).  This  indicates  that,  at  least  for  ChIP-chip  with  our  CpG  island  arrays,  reciprocal 
labeling  is  not  necessary  and  dye-bias  is  a  negligible  confounder.  Nevertheless,  it  seems 
prudent  to  include  dye-status  as  a  randomization  factor  in  experimental  design  in  the  case 
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of  unanticipated  confounding  dye-bias  in  specific  experimental  scenarios,  especially 
because  of  the  profound  asymmetry  in  this  parameter.  Instead  of  dye-swapping  a  single 
sample  we  recommend  simply  balancing  dye  labels  across  experimental  groups. 

The  fourth  experimental  parameter  considered  was  the  effect  of  hybridization- 
date.  Because  our  experiment  was  saturated  and  fully  blocked  across  all  factors,  we  were 
able  to  directly  estimate  the  effects  of  this  parameter  with  a  linear  model.  We  were 
surprised  to  find  that  inter-day  hybridization  had  more  influence  than  either  batch-effect 
or  dye-swap  status  (Fig.  3).  The  sources  of  this  bias  are  unclear.  The  symmetrical 
distribution  of  the  effect  suggests  that  neither  ozone-status  nor  scanner  settings  are  a 
major  factor.  Perhaps  use  of  a  mechanical  hybridization  and  wash  station  would  decrease 
inter-day  variability.  Control  of  this  factor  is  straight-forward:  samples  should  be 
randomized  across  experimental  day,  so  that  biological  replicates  of  a  given  condition  are 
hybridized  on  different  days. 

The  fifth  experimental  parameter  considered  was  the  method  of  DNA 
amplification.  While  the  antibody-based  ChIP  protocol  imparts  great  specificity  and 
sensitivity  to  ChIP-chip  experiments,  most  ChIP  protocols  result  in  a  relatively  small 
amount  of  DNA.  Because  the  quantity  of  DNA  necessary  for  hybridization  cannot 
usually  be  obtained  in  a  single  ChIP,  two  basic  strategies  have  evolved:  pooling  of 
multiple  ChIPs  [12]  and  amplification  of  ChIP  DNA  [25].  A  variety  of  different  PCR- 
based  amplification  methods  have  been  developed  and  are  in  use  by  different  groups. 
Assessing  the  performance  of  these  methods  has  been  a  challenging  problem.  To  directly 
compare  an  amplified  ChIP  sample  to  an  unamplified  ChIP  sample  requires  the  pooling 
of  a  large  number  of  ChIPs  for  each  replicate,  resulting  in  significant  costs,  time  and 
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labor.  Further,  care  must  be  taken  in  the  experimental  design  to  ensure  that  the  reduction 
in  variance  that  results  from  pooling  ChIP  reactions  does  not  confound  the  effects  of  the 
amplification  procedure.  As  a  result  of  these  financial  and  technical  factors,  we  are  not 
aware  of  any  detailed,  replicate  studies  measuring  the  performance  of  amplification 
methods  for  ChIP-chip  data,  although  such  studies  do  exist  for  mRNA  experiments  [43, 
44],  where  a  much  larger  amount  of  starting  material  is  readily  available.  To  resolve  this 
issue,  we  have  developed  a  novel  approach  that  hybridizes  amplified  against  non 
amplified  genomic  DNA  (Fig.  4A).  We  evaluated  three  amplification  protocols:  RP-PCR, 
or  one  of  the  two  protocols  of  LM-PCR.  The  matched  amplified  and  unamplified  sample 
pairs  were  then  hybridized  to  12k  CpG  island  microarrays  in  equal  quantities.  Because  of 
its  simplicity,  this  procedure  could  be  readily  repeated  and  an  n=4  was  generated  for  each 
amplification  procedure.  The  number  of  spots  found  to  be  differentially  hybridized 
between  the  matched  amplified  and  unamplified  samples  is  then  taken  as  a  measure  of 
amplification  bias,  and  is  plotted  for  a  range  of  p-values  (Fig.  4B).  Interestingly,  the 
performance  of  random-priming  falls  in  between  that  of  the  two  LM-PCR  protocols, 
indicating  that  specific  parameter  choices  within  a  protocol  are  important  for  determining 
overall  performance.  Our  method  of  comparing  amplification  procedures  provides  a  new 
way  to  measure  variability,  independent  of  antibody  or  biological  question,  at  a 
reasonable  cost. 

The  final  experimental  design  parameter  considered  was  the  selection  of  a 
negative  control  for  hybridization  to  the  array.  We  considered  three  approaches  (Fig. 

5A),  including  two  direct  and  one  indirect  method.  For  each  method  we  performed  four 
Myc  ChIP-chip  experiments  in  HL60  cells.  When  we  compared  the  sensitivity  of  these 
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methods  (Fig.  5B)  the  two  direct  methods  performed  comparably,  with  no  significant 
differences  between  the  IgG  and  no-antibody  controls.  The  indirect  design,  however,  did 
show  increased  variability,  with  reduced  sensitivity  at  any  given  p-value.  Thus,  direct 
designs,  which  use  half  as  many  arrays  as  indirect  designs,  are  favored  for  both  cost  and 
sensitivity  reasons. 

Finally,  we  set  up  a  Myc  ChIP-chip  experiment  to  test  these  optimized  parameters 
in  an  integrated  way.  Following  extensive  biological  replication  (n=13)  we  validated 
putative  Myc  target  genes  by  Q-PCR  from  conventional  ChIP  reactions.  Importantly,  our 
validation  used  biologically  independent  samples  to  ensure  maximal  generalization  of  the 
results.  We  validated  4  of  5  genes  from  the  top  third  of  the  list  ( GDAP2 ,  RPL34,  PHLPP 
and  TCERG1 ),  the  4  of  4  genes  taken  from  the  middle  third  of  the  list  ( CDC14A ,  CDCA2 
and  NOL6),  and  3  of  4  genes  taken  from  the  bottom  third  of  the  list  ( PLTP ,  KLC4  and 
GAS1)  (Fig.  6).  One  reason  of  getting  false-positives  in  the  ChIP-chip  experiment  could 
be  the  non-specific  DNA-binding  property  of  Myc  [45].  This  feature  is  most  likely  the 
reason  of  why  some  targets  are  identified  as  potential  Myc-binding  sites,  but  are  not 
validated  upon  further  testing.  In  total,  then,  we  have  identified  535  novel  putative  Myc 
target  genes,  of  which  1 1  have  been  validated  by  gold-standard  Q-PCR.  In  addition,  35 
spots  out  of  the  535  spots  represented  genes  whose  expression  is  known  to  be  regulated 
by  Myc.  Moreover,  85  spots  corresponded  to  genes  that  belong  to  a  family  in  which 
another  member  has  been  described  as  a  Myc-regulated  gene  [37],  Five  of  these  have 
been  tested  and  validated  ( GDAP2 ,  RPL34,  CDC14A,  CDCA2  and  NOL6 )  suggesting  the 
remaining  80  targets  are  likely  to  also  be  Myc  target  genes.  Importantly,  we  showed  that 
the  validation  rate  remained  high  in  the  bottom  portion  of  our  list.  This  suggests  that  the 
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statistical  analysis  was  conservative,  and  that  additional  hits  might  be  uncovered  by 
relaxing  our  p-value  threshold  from  0.05  to  0.10  or  higher.  This  is  particularly  important 
because  many  published  ChIP-chip  studies  use  a  p-value  threshold  of  0.001,  50  times 
more  stringent  than  used  here  [11].  Overall,  validation  of  the  Myc  target  genes  identified 
in  this  study  confirms  the  success  of  our  parameter  optimizations. 

Over  the  past  decade,  a  series  of  studies  from  multiple  groups  served  to 
characterize  the  major  experimental  design  parameters  for  mRNA  expression  studies  [28, 
42,  46-48].  A  common  theme  of  these  studies  is  that  it  is  necessary  to  use  both  empirical 
and  statistical  methodologies  to  optimize  the  experimental  design.  We  have  provided  the 
first  comprehensive  overview  of  experimental  design  parameters  for  ChIP-chip  data, 
testing  both  parameters  related  to  all  microarray  experiments  and  those  unique  to  ChIP- 
chip  studies.  We  find  good  concordance  of  ChIP-chip  and  mRNA  expression  design 
characteristics.  For  example,  in  both  cases,  indirect  hybridization  designs  appear  to  be 
more  costly  without  providing  a  significant  advantage  over  simpler  direct  designs. 
Similarly,  we  have  shown  that  biological  replicates  are  more  important  than  dye-swaps, 
just  as  identified  for  mRNA  expression  arrays  [42],  We  also  demonstrated  that  antibody 
characteristics  and  amplification  methods  are  major  sources  of  variability  in  ChIP-chip 
studies.  For  example,  using  an  antibody  of  maximal  specific  activity  appears  to  be 
critical  for  increasing  the  sensitivity  of  ChIP-chip  studies.  In  addition,  we  introduced  a 
novel  method  for  testing  amplification  bias  in  an  antibody-independent  manner.  This 
study  indicated  that  amplification  procedures  can  significantly  bias  ChIP-chip  results,  and 
provides  a  simple,  efficient  method  for  benchmarking  the  bias  of  specific  amplification 
protocols. 
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Our  findings  also  raise  a  number  of  intriguing  questions  surrounding  the  antibody 
and  amplification  results.  We  demonstrate  that  antibody-specific  baseline  variability 
exists  in  ChIP-chip  studies,  and  that  while  it  can  be  mitigated  it  cannot  be  completely 
removed.  It  would  be  of  great  interest  to  replicate  these  studies  with  monoclonal 
antibodies,  or  polyclonal  antibodies  raised  against  different  regions  of  a  protein.  In  this 
manner,  it  might  be  possible  to  identify  characteristics  that  predict  which  antibodies  will 
be  most  sensitive  or  specific  in  a  ChIP-chip  study.  Alternatively,  this  may  not  be  a 
generalizable  parameter  and  may  be  protein-specific.  Similarly,  we  have  presented  a 
novel  technique  for  studying  amplification  bias,  and  identified  a  variant  of  ligation- 
mediated  PCR  as  most  favorable  in  this  regard.  Nevertheless,  all  amplification  protocols 
tested  here  introduced  a  significant  amount  of  bias.  Our  strategy  for  bias-identification 
should  prove  useful  in  the  development  of  reduced-bias  amplification  protocols. 

As  ChIP-chip  studies  continue  to  broaden  in  the  scope  and  detail  of  the  biological 
questions  they  probe,  the  application  of  robust  and  validated  experimental  designs  will  be 
a  significant  advantage  to  the  field. 
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CONCLUSIONS 

We  performed  a  series  of  empirical  studies  to  systematically  evaluate  the  effects 
of  key  design  parameters  for  ChlP-chip  studies.  Antibody  purity  is  an  important  factor  for 
controlling  the  specificity  and  sensitivity  of  ChlP-chip  experiments,  thus,  antibody 
purification  should  be  routinely  considered  to  improve  assay  performance.  The  use  of 
multiple  array  batches  increased  bias,  with  older  arrays  showing  lower  signal  intensity 
than  newer  arrays,  indicating  that  either  a  single  batch  of  arrays  be  used,  or  experimental 
samples  be  randomized  across  batches.  Minimal  dye  bias  was  observed  in  our  study, 
although  prudent  experimental  design  would  include  dye  swap  as  a  randomization  factor 
by  balancing  dye  labels  across  experimental  groups  to  guard  against  unexpected  biases. 
The  date  of  hybridization  was  a  major  source  of  variability,  but  the  source  of  this  factor 
remains  unclear.  Evidently,  samples  should  be  randomized  across  experimental  days. 
Three  DNA  amplification  methods  were  evaluated  by  a  novel  approach  that  hybridizes 
amplified  against  non-amplified  genomic  DNA.  This  is  a  simple  way  to  evaluate  the  bias 
of  an  amplification  protocol,  independent  of  the  biological  question.  We  found  that 
ligation-mediated  PCR  introduced  less  bias  than  random-priming  PCR,  although  bias  was 
found  to  be  sensitivity  to  parameterization.  Finally,  we  found  that  direct  hybridization 
controls  reduced  variability  and  increased  sensitivity  relative  to  indirect  controls.  Thus, 
direct  designs  are  preferential  for  both  sensitivity  and  financial  reasons.  To  integrate 
these  findings,  we  tested  the  optimized  parameters  in  a  Myc  ChlP-chip  experiment  in 
HL60  cells.  High  validation  rates  observed  by  Q-PCR  in  independent  samples  show  the 
sensitivity  and  specificity  of  our  optimized  assay.  This  systematic  and  thorough 
optimization  study  is  a  key  step  towards  putting  our  understanding  of  ChlP-chip 
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experimental  parameters  on  an  equal  footing  with  our  understanding  of  mRNA 
expression  arrays,  and  will  enable  improved  elucidation  of  gene-regulatory  networks. 
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MATERIALS  AND  METHODS 

Antibody  production  and  purification 

The  DNA  fragment  corresponding  to  the  Myc  1-262  N-terminal  domain  polypeptide  was 
cloned  into  pET15b  vector  (Novagen  69661-3)  at  5’-NdeI  and  3’-BamHI  sites.  His-c- 
Myc  (1-262)  fusion  protein  was  purified  under  denatured  conditions  using  Talon  beads 
(BD  Biosciences)  according  to  the  manufacturer’s  protocol.  The  purified  protein  was 
concentrated  to  2.8  mg/ml  before  injecting  into  the  rabbit.  The  purified  protein  was  then 
dissolved  in  a  volume  of  500  pL  of  PBS  per  rabbit.  Primary  immunizations  were  done 
with  500  pg  of  purified  protein  per  rabbit.  First  boost  was  done  with  250  pg  of  purified 
protein.  Two  subsequent  boosts  were  done  with  50  pg  of  purified  protein.  The  final  bleed 
was  then  purified  using  Protein  A,  IgG  Purification  (Pall  PN5300-IgG  proA)  according  to 
manufacturer’s  protocol. 

Cell  Culture  conditions 

HL60  (ATCC)  cells  were  maintained  in  a-MEM  with  10%  FBS. 

Chromatin  immunoprecipitation 

Exponentially  growing  HL60  cells  were  cross-linked  with  1%  formaldehyde  for  10 
minutes  at  37  °C.  The  crosslinking  reaction  was  quenched  by  addition  of  glycine  to  a 
final  concentration  of  0.125  M  for  5  minutes,  followed  by  two  washes  with  phosphate- 
buffered  saline  (PBS).  Cells  were  resuspended  in  cell  lysis  buffer  (5  mM  PIPES  pH  8,  85 
mM  KC1,  0.5%  [v/v]  NP40,  1  mM  PMSF,  10  pg/mL  aprotinin,  10  pg/mLleupeptin)  for 
10  minutes  on  ice  and  then  pelleted  (5000  rpm,  5  minutes,  4  °C).  The  pellet  was 
resuspended  in  1  mL  of  nuclei  lysis  buffer  (50mM  Tris-HCl  pH  8.1,  lOmM  EDTA,  1% 
SDS,  1  mM  PMSF,  10  pg/mL  aprotinin,  10  pg/mL  leupeptin)  for  10  minutes  on  ice  and 
then  sonicated  using  8  pulses  (12-13  Watts,  setting  10,  10  seconds  per  pulse,  45  seconds 
on  ice  between  pulses)  from  a  Model  60  Sonic  Dismembrator  (Fisher  Scientific  15-338- 
53)  to  generate  fragments  between  600  bp  and  1000  bp.  Lysates  were  centrifuged  for  10 
minutes  at  21,000  x  g  at  4  °C.  Supernatants  were  diluted  into  an  equal  volume  with  IP 
dilution  buffer  (0.01%  SDS,  1.1%  Triton-XlOO,  1.2mN  EDTA,  16.7mM  Tris-HCl  pH 
8.1,  0.2%  Sarkosyl,  1  mM  PMSF,  10  pg/mL  aprotinin,  10  pg/mL  leupeptin)  and  pre¬ 
cleared  for  30  minutes  at  4  °C  with  protein  G-PLUS  agarose  beads  (Santa  Cruz 
Biotechnology  sc-2002).  Prior  to  use,  G-PLUS  agarose  beads  were  blocked  with  salmon 
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sperm  DNA  at  a  final  concentration  of  50  pg/mL  and  rotated  overnight  at  4  °C.  Diluted 
and  cleared  extracts  corresponding  to  10  x  106  HL60  cells  were  incubated  and  rotated  at 
4°C  for  approximately  12  to  16  hours  with  each  of  the  following  antibodies:  no-antibody 
control,  0.7  pg  normal  rabbit  IgG  (Santa  Cruz  Biotechnology  sc-2027),  0.7  pg  N262 
(Santa  Cruz  Biotechnology  sc-764),  for  the  N262  home-made  unpurified  and  purified 
antibodies,  we  determined  empirically,  by  serial  dilutions,  the  amount  of  antibody  to  be 
used.  50  pL  of  salmon  sperm  DNA  pre-blocked  Protein  G-PLUS  agarose  beads  were 
added  to  each  sample,  incubated  on  a  rotating  platform  at  4  °C  for  3  hours.  Each  pellet 
was  washed  once  with  1.4  mL  of  sonication  buffer  and  then  twice  with  1.4  mL  of  high 
salt  buffer  (0.1%  [v/v]  SDS,  1%  [v/v]  Triton  X-100,  1  mM  EDTA,  50  mM  HEPES,  500 
mM  NaCl,  0.1%  [w/v]  sodium  deoxycholate)  and  then  once  with  1.4  mL  LiCl  Buffer 
(250  mM  LiCl,  1%  [v/v]  NP-40,  1%  [w/v]  sodium  deoxycholate,  1  mM  EDTA,  1  mM 
Tris  pH  8)  and  finally  twice  with  1.4  mL  TE  pH  8  (10  mM  Tris  pH8,  1  mM  EDTA).  For 
each  wash,  the  pellets  were  mixed  for  5  minutes  at  room  temperature  then  pelleted  (3000 
rpm,  30  seconds,  room  temperature).  After  the  last  wash,  the  pellets  were  eluted  in  300 
pL  of  Elution  buffer  (1%  [w/v]  SDS,  10  mM  Tris  pH  8,  5  mM  EDTA),  incubated  at  65 
°C  for  15  minutes,  and  then  pelleted  (3000  rpm,  3  minutes,  room  temperature).  Cross¬ 
links  were  reversed  in  the  presence  of  200  mM  NaCl  at  65  °C  over-night  and  samples 
were  treated  with  RNase  A  (Sigma  R5500).  After  ethanol  precipitation,  the  samples  were 
resuspended  in  100  pL  of  TE  (10  mM  Tris,  pH  7.5,  1  mM  EDTA),  25  pL  of  5x 
proteinase  K  buffer  (1.25%  SDS,  50  mM  Tris,  pH  7.5,  25  mM  EDTA),  and  1.5  pL  of 
proteinase  K  (Roche  1413783)  and  incubated  at  42  °C  for  2  hours.  DNA  was  extracted 
from  each  sample  using  phenol:chloroform:isoamyl  alcohol  (25:24:1),  then  precipitated 
with  l/10th  volume  of  3  M  sodium  acetate  (pH  5.3),  5  pg  of  glycogen,  and  2  volumes  of 
ethanol  at  -20  °C  overnight.  Pellets  were  collected  by  microcentrifugation  and 
resuspended  in  60  pL  of  H20. 

PCR 

A  total  of  2  pL  of  the  ChIP  DNA  was  PCR  amplified  using  promoter-specific  primers. 
All  PCR  primers  (synthesized  by  Sigma  Genosys  or  Invitrogen)  were  resuspended  in 
sterile  RNase-/DNase-free  water.  In  general,  each  10  pL  PCR  reaction,  set  up  at  room 
temperature,  was  composed  of  10  ng/pL  of  each  primer  (fwd/rev),  lx  PCR  buffer,  0.2 
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mM  dNTP  mix,  sterile  water,  DNA  template,  and  0.2  pL  HotstarTaq  DNA  polymerase 
(Qiagen  203205);  the  HotstarTaq  was  activated  by  initiating  the  PCR  program  at  95°C 
for  15  minutes.  Amplification  of  each  PCR  primer  set  was  performed  using  multiple 
annealing  temperatures  along  a  52  °C  to  68  °C  gradient  for  30  seconds.  Primer  sequences 
are  provided  in  Supplementary  Table  2.  All  PCR  reactions  were  electrophoresed  on  100 
mL  1.2%  agarose  gels  containing  4  pL  of  10  mg/mL  EtBr,  and  then  visualized  by  UV 
florescence. 

Real-time  quantitative  PCR  (Q-PCR) 

Real-time  quantitative  PCR  amplification  was  conducted  using  the  SYBR  Green  assay  in 
the  ABI  PRISM  7900-HT  (Applied  Biosystems).  Each  12  pL  quantitative  PCR  reaction 
was  composed  of  the  following:  2  pL  of  the  DNA  isolated  in  the  ChIP,  lx  PCR  Buffer, 
2.5  mM  MgCh,  0.17  mM  dNTP  mix,  sterile  water,  0.25x  SYBR  Green  (Sigma  S9430), 
0.2  pL  Rox  reference  dye  (Invitrogen  12223-012)  and  0.05  pL  Platinum  Taq  DNA 
polymerase  (Invitrogen  10966-034)  and  was  performed  in  triplicate  in  a  3 84- well  plate. 
The  reactions  began  at  50  °C  for  2  minutes  and  then  were  activated  at  95  °C  for  10 
minutes  followed  by  40  cycles  of  95  °C  for  15  seconds  and  60  °C  for  1  minute.  Human 
male  genomic  DNA  (Novagen  70572)  was  used  as  standard  real-time  quantitative  PCR 
(Q-PCR)  was  conducted  in  triplicates  for  each  of  the  four  independent  biological 
replicates.  The  Q-PCR  data  was  analyzed  by  calculating  the  Myc/IgG  ratios  for  each 
target  gene,  and  the  Chr21  negative  control.  A  symmetrical  distribution  was  then 
obtained  by  log2-transformation  of  each  individual  ratio.  Statistical  testing  involved 
comparison  of  the  ratios  for  each  target  gene  to  the  ratios  of  the  Chr21  negative  control 
using  a  two-tailed  paired  t-test.  Primer  sequences  are  provided  in  Supplementary  Table  3. 
The  primers  were  designed  using  Primer  Express  Software  Version  2.0  according  to  the 
manufacturer  procedures  (Applied  Biosciences). 

Random  primer  PCR  amplification  method 

The  immunoprecipitated  DNA  was  amplified  as  described  elsewhere  [49]  with  the 
following  modifications:  15  pL  of  Round  A  are  mixed  with  Round  B1  Reaction  Mix  to  a 
final  volume  of  100  pL  [4pL  of  50  mM  MgCh,  10  pL  of  lOx  PCR  Buffer,  2  pL  of  lOmM 
dNTPs,  2.5  |lL  of  100  |lM  of  Primer  B  (5’GTTTCCCAGTCACGATC3’),  1  pL  of  Taq 
DNA  Polymerase  (Promega  M1861)].  Amplification/Nucleotide  Incorporation  Program: 
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(94  °C  for  30  seconds,  40  °C  for  30  seconds,  50  °C  for  30  seconds,  72  °C  for  1  minute)  x 
20  cycles,  72  °C  for  2  minutes. 

15  pL  of  Round  B1  were  mixed  with  Round  B2  Reaction  Mix  to  a  final  volume  of  50  pL 
[2  pL  of  50  mM  MgCl2,  5  pL  of  10X  PCR  Buffer,  0.5  pi  of  dNTP  Mix  B2  (25mM  of 
dATP,  dCTP,  dCTP,  12.5  mM  dTTP),  1.5  pL  of  2  mM  aa-dUTP,  1  pL  of  100  (iM  of 
Primer  B  (5’GTTTCCCAGTCACGATC3’),  1  pL  of  Taq  DNA  Polymerase  (Promega 
M1861)].  Amplification/Nucleotide  Incorporation  Program:  (92  °C  for  30  seconds,  40  °C 
for  30  seconds,  50  °C  for  30  seconds,  72  °C  for  1  minute)  x  15  cycles,  72  °C  for  2 
minutes.  Round  B2  amplified  DNA  was  concentrated  using  Microcon  columns 
(Millipore,  Microcon  YM-30).  Once  the  ChIP  DNA  was  amplified  to  3  pg,  PCR 
reactions  were  performed  with  10  ng  amplified  DNA  and  were  identical  to  the  reactions 
performed  after  the  ChIP  DNA  was  initially  isolated.  This  PCR  was  used  to  determine 
whether  specificity  between  the  IgG  or  NoAb  and  antibody  samples  was  maintained  after 
the  amplification. 

Ligation-mediated  PCR  (LM-PCR)  Amplification 

The  DNA  from  each  ChIP  reaction  was  treated  as  described  elsewhere  [25],  The  PCR 
was  performed  using  the  following  cycling  protocol:  1  cycle  of  55  °C  for  2  minutes,  72 
°C  for  5  minutes,  and  95  °C  for  2  minutes  and  the  corresponding  number  of  cycles  for 
LM-PCR  A  and  LM-PCR  B  (see  below  LM-PCR  A  and  B  cycle  conditions)  at  95  °C  for 
30  seconds,  55  °C  for  30  seconds,  and  72  °C  for  1  minute,  ending  with  4  minutes  at  72  °C 
and  then  an  endless  cooling  at  4  °C.  After  the  PCR,  the  DNA  was  again  isolated  using  the 
QIAquick  PCR  purification  kit  (Qiagen  28106).  The  concentration  of  the  purified  DNA 
was  determined  on  the  NanoDrop  ND-1000  spectrophotometer.  Often  the  3  pg  required 
for  hybridization  was  not  acquired  after  this  first  round  of  amplification,  so  another  round 
of  amplification  would  be  performed,  followed  by  purification  and  determination  of  the 
DNA  concentration  as  previously  described.  LM-PCR  A  amplification:  20  cycles 
followed  by  QIAquick  PCR  purification,  and  then  15  cycles  followed  by  QIAquick  PCR 
purification.  LM-PCR  B  amplification:  20  cycles  followed  by  QIAquick  PCR 
purification,  repeated  3  rounds,  sequentially.  Once  the  ChIP  DNA  was  amplified  to  3  pg, 
PCR  reactions  of  positive  and  negative  control  genes  (Supplementary  Table  2)  were 
performed  with  10  ng  amplified  DNA  and  were  identical  to  the  reactions  performed  after 
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the  ChIP  DNA  was  initially  isolated.  This  PCR  was  necessary  to  determine  whether 
specificity  between  the  IgG  or  NoAb  and  antibody  samples  was  maintained  after  the 
amplification. 

Labeling  reaction 

A  total  of  3  (ig  of  amplified  DNA  from  the  antibody  sample,  no  antibody,  or  IgG  sample 
were  vacuum  desiccated  and  resuspended  in  2.5  |lL  of  H20  (Sigma  W4502).  4.5  (tL  of 
0.1  M  NaHCOs  at  pH  9  was  added.  The  DNA  was  resuspended  by  vortexing  several 
times.  2  |aL  of  Cy3  or  Cy5  dye  (Amersham,  PA23001  and  PA25001  respectively)  were 
added  to  the  sample.  The  mix  was  incubated  for  1.5  hours  at  room  temperature  in  the 
dark.  After  incubation,  35  (tL  of  100  mM  Na- Acetate  pH5.2  was  added  and  topped  with 
H20  (Sigma  W4502)  to  a  total  volume  of  100  (tL  for  each  sample.  The  fluorescent 
labeled  probes  were  purified  using  QIAquick  PCR  Purification  kit  (Qiagen  28106).  The 
DNA  was  eluted  with  50  (tL  of  EB  buffer  from  the  kit,  heated  to  65°C.  The  100  (tL 
combined  ChIP  and  no  antibody  samples  were  vacuum  desiccated  using  a  SpeedVac  for 
60  minutes  at  maximum  heat. 

DNA  Microarray  Hybridization 

The  mixed  DNA  was  resuspended  in  5  (iL  of  H?0  and  mixed  with  85  (iL  hybridization 
mix  (100  (tL  of  DIG  Easy  Hyb  solution  (Roche  603  558),  5  (iL  of  10  mg/mL  calf  thymus 
DNA  (Sigma  D8661),  5  (tL  of  10  mg/mL  yeast  tRNA  (Invitrogen  15401-029)).  The 
resulting  solution  was  incubated  at  65  °C  for  2  minutes,  then  cooled  to  room  temperature 
and  applied  to  a  CpG  island  array  slide  (UHN  Microarray  Centre  HCGI12K)  and 
incubated  overnight  at  42  °C  in  a  hybridization  chamber. 

Microarray  washes 

After  hybridization,  the  microarray  slide  was  washed  once  with  washing  buffer  1  (lx 
SSC,  0.1%  SDS)  at  50°C  for  5  minutes.  This  was  followed  by  a  wash  with  washing 
buffer  1  (room  temperature  for  5  minutes)  and  2  washes  with  washing  buffer  2  (0.2x  SSC 
for  5  minutes)  and  one  final  wash  with  washing  buffer  3  (O.lx  SSC  for  5  minutes  at  room 
temperature).  The  slide  was  then  dried  by  spinning  at  700  rpm  for  15  minutes.  To 
preserve  the  Cy5  dye  from  ozone-dependent  degradation,  which  was  observed  during  the 
summertime  when  the  hybridization  occurred,  the  slides  were  then  dipped  in  DyeSaver  2 
(Genisphere  Q500600)  and  allowed  to  air  dry. 
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Microarray  Scanning 

The  microarray  slides  were  scanned  using  a  Gene  Pix  4000B  scanner  (Axon  Instruments) 
at  multiple  PMT  voltages. 

Microarray  Data  Pre-Processing 

Array  images  were  first  manually  examined  for  image  artifacts,  and  then  quantitated 
using  GenePix  Pro  (v6.0.1.27).  A  scan  that  maximized  the  dynamic  range  without 
saturating  high-intensity  signals  was  selected  and  carried  forward  for  analysis.  This  scan 
was  quantified  using  the  “circular”  segmentation  algorithm,  then  pre-processed  using  the 
variance-stabilizing  normalization  algorithm  (VSN).  VSN  was  implemented  in  the  R 
statistical  environment  (v2.4.1)  in  the  BioConductor  open-source  project  [50].  Version 
1.12.0  of  the  vsn  package  was  employed,  with  print-tip  groups  used  as  strata,  and  default 
parameterization  except  for  using  1000  iterations  for  fitting  (increased  from  the  default  of 
10).  All  raw  and  processed  array  data  will  be  made  publicly  available  in  the  Gene 
Expression  Omnibus  (GEO)  repository  at  NCBI  -  an  accession  will  be  available  prior  to 
publication. 

Significance  Testing 

To  test  for  differential  hybridization,  we  exploited  general  linear  modeling,  as 
implemented  in  the  limma  package  (v2.9.13)  [51]  of  the  BioConductor  open-source 
library  [50]  for  the  R  statistical  environment  (v2.4.1).  First,  a  spot-wise  linear  model  was 
fit  separately  to  each  major  design  parameters:  array  batch,  hybridization  date,  and  dye- 
status.  No  interaction  effects  were  included  due  to  lack  of  physical  motivation.  A 
Bayesian  moderation  of  standard  error  was  employed  following  model- fitting  [51].  A 
Gaussian  kernel  density  was  then  employed  on  the  effect  estimates  to  compare  the 
relative  magnitudes  of  these  design  parameters.  Next,  a  spot-wise  linear  model  was  fit 
comparing  the  hybridizations  using  a  commercially  available  antibody  to  those  using  a 
locally  produced  one.  The  top  100  spots  from  each  analysis  were  identified,  and  the  pair¬ 
wise  p-values  plotted  to  identify  trends  in  antibody  specificity.  A  similar  analysis  was 
performed  for  purified  and  non-purified  locally-produced  antibody  hybridizations.  All 
model-fitting  again  employed  the  limma  package  (v2.9.13).  To  compare  different 
amplification  methods  we  fit  separate  linear  models  to  the  amplified  vs.  non-amplified 
comparisons  for  each  amplification  method.  The  number  of  differentially  bound  spots 
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between  the  two  samples  was  calculated  across  the  entire  range  of  naive  p-value 
thresholds  from  0  to  1,  at  0.00001  unit  resolution.  A  similar  analysis  was  used  to 
investigate  hybridization  order,  except  that  the  models  were  fit  simultaneously  and  the 
effect  of  each  hybridization  order  was  extracted  separately  using  a  contrast  matrix.  An 
identical  tiling  across  p-values  was  used,  in  this  case  in  the  range  0  to  0.10  with  a 
resolution  of  0.0001.  These  analyses  were  performed  in  R  (v2.4.1).  All  gene-lists 
presented  are  based  on  a  p-value  threshold  of  p  <  0.05. 

Spot  Annotation 

The  published  array  annotations  for  the  12k  UHN  CpG  island  microarray  were  based  on 
the  UCSC  build  hgl7  of  the  human  genome  [30].  To  update  these  results  for  the  more 
polished  hgl8  build  we  implemented  a  new  annotation  algorithm  using  a  BioPerl  (vl.5.1) 
based  Perl  (v5.8.8)  script  [52],  First,  the  3’  and  5’  sequence-reads  were  overlapped  using 
bl2seq  [53]  with  a  word-size  of  7.  In  cases  where  a  minimum  50  base-pair  overlap  was 
found  the  aggregated  sequence  was  selected;  otherwise  the  longer  of  the  two  sequence- 
reads  was  chosen.  Next,  the  sequences  were  aligned  to  the  RepeatMasker  filtered 
chromosomal  sequence-reads  of  human  genome  build  hgl8  [54]  using  a  locally-compiled 
(gcc  v4.1.1;  AIX  5. 2. 0.0)  NCBI  BLASTN  executable  with  a  word-size  of  7  [55].  The 
best  matching  hit  was  selected  as  the  most  likely  region  of  hybridization,  and  the  ratio  of 
the  lengths  of  the  top  two  hits  was  used  as  a  measure  of  the  likelihood  of  cross¬ 
hybridization. 
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LIST  OF  ABBREVIATIONS  USED 

ChIP-chip:  Combination  of  Chromatin  Immunoprecipitation  with  high-throughput  DNA 

microarray  technology 

PCR:  Polymerase  chain  reaction 

Q-PCR:  Real  time  quantitative-PCR 

LM-PCR:  Ligation  mediated-PCR 

RP-PCR:  Random  primer-PCR 

Ab:  Antibody 

NoAb:  No  antibody 

IgG:  Immunoglobuline  G 


29 


SUBMITTED 


AUTHORS’  CONTRIBUTIONS 

R.P.,  P.C.B.  and  L.Z.P.  designed  and  initiated  the  project.  R.P.  performed  the  majority  of 
the  experiments,  including  the  target  gene  validations  of  Fig.  6.  P.C.B.  performed  all 
bio  informatics  analysis.  S.K.,  A.S.,  A.P.H.  and  F.K.  performed  the  ChIP-chip  reactions 
analyzed  in  Fig.  6.  All  authors  edited,  reviewed,  and  approved  the  final  manuscript. 


30 


SUBMITTED 


ACKNOWLEDGEMENTS 

The  authors  acknowledge  support  from  the  U.S.  Department  of  Defense  Breast  Cancer  Research  Program 
grant  BC032138  (LZP),  a  Knudson  Postdoctoral  Fellowship  (RP),  fellowships  from  the  PreCarn 
Foundation  (PCB),  the  Natural  Sciences  and  Engineering  Research  Council  (PCB),  and  the  CIHR 
EIRR21st  STIHR  (PCB),  the  Terry  Fox  Foundation  Fellowship  through  an  award  from  the  National  Cancer 
Institute  of  Canada  (SK),  Ontario  Graduate  Scholarship  (AS,  APH).  We  also  thank  the  Penn  Lab  for 
helpful  comments  during  the  course  of  these  experiments  and  in  reviewing  the  manuscript,  and  to  Mr. 
Richard  Lu  for  computer  system  support  and  administration. 


31 


SUBMITTED 


REFERENCES 


1.  Iyer  VR,  Horak  CE,  Scafe  CS,  Botstein  D,  Snyder  M,  Brown  PO:  Genomic 
binding  sites  of  the  yeast  cell-cycle  transcription  factors  SBF  and  MBF. 

Nature  2001,  409(68 19):533-538. 

2.  Lee  TI,  Rinaldi  NJ,  Robert  F,  Odom  DT,  Bar-Joseph  Z,  Gerber  GK,  Hannett  NM, 
Harbison  CT,  Thompson  CM,  Simon  I  et  ah  Transcriptional  regulatory 
networks  in  Saccharomyces  cerevisiae.  Science  2002,  298(5594):799-804. 

3.  Lieb  JD,  Liu  X,  Botstein  D,  Brown  PO:  Promoter-specific  binding  of  Rapl 
revealed  by  genome- wide  maps  of  protein-DNA  association.  Nat  Genet  2001, 
28(4):327-334. 

4.  Ren  B,  Robert  F,  Wyrick  JJ,  Aparicio  O,  Jennings  EG,  Simon  I,  Zeitlinger  J, 
Schreiber  J,  Hannett  N,  Kanin  E  et  ah  Genome- wide  location  and  function  of 
DNA  binding  proteins.  Science  2000,  290(5500) :2306-2309. 

5.  Bieda  M,  Xu  X,  Singer  MA,  Green  R,  Farnham  PJ:  Unbiased  location  analysis 
of  E2Fl-binding  sites  suggests  a  widespread  role  for  E2F1  in  the  human 
genome.  Genome  Res  2006,  16(5):595-605. 

6.  Carroll  JS,  Liu  XS,  Brodsky  AS,  Li  W,  Meyer  CA,  Szary  AJ,  Eeckhoute  J,  Shao 
W,  Hestermann  EV,  Geistlinger  TR  et  ah.  Chromosome- wide  mapping  of 
estrogen  receptor  binding  reveals  long-range  regulation  requiring  the 
forkhead  protein  FoxAl.  Cell  2005, 122(1  ):33-43. 

7.  Cheng  AS,  Jin  VX,  Fan  M,  Smith  LT,  Liyanarachchi  S,  Yan  PS,  Leu  YW,  Chan 
MW,  Plass  C,  Nephew  KP  et  ah  Combinatorial  analysis  of  transcription  factor 
partners  reveals  recruitment  of  c-MYC  to  estrogen  receptor-alpha 
responsive  promoters.  Mol  Cell  2006,  21(3):393-404. 

8.  Fernandez  PC,  Frank  SR,  Wang  L,  Schroeder  M,  Liu  S,  Greene  J,  Cocito  A, 
Amati  B:  Genomic  targets  of  the  human  c-Myc  protein.  Genes  Dev  2003, 

17(9):  11 15-1 129. 

9.  Horak  CE,  Mahajan  MC,  Luscombe  NM,  Gerstein  M,  Weissman  SM,  Snyder  M: 
GATA-1  binding  sites  mapped  in  the  beta-globin  locus  by  using  mammalian 
chip-chip  analysis.  Proc  Natl  Acad  Sci  USA  2002,  99(5):2924-2929. 

10.  Kirmizis  A,  Bartley  SM,  Kuzmichev  A,  Margueron  R,  Reinberg  D,  Green  R, 
Farnham  PJ:  Silencing  of  human  poly  comb  target  genes  is  associated  with 
methylation  of  histone  H3  Lys  27.  Genes  Dev  2004,  18(13):  1592- 1 605. 

11.  Li  Z,  Van  Calcar  S,  Qu  C,  Cave  nee  WK,  Zhang  MQ,  Ren  B:  A  global 
transcriptional  regulatory  role  for  c-Myc  in  Burkitt's  lymphoma  cells.  Proc 
Natl  Acad  Sci  U  S  A  2003,  100(  14):8 164-8 169. 

12.  Mao  DY,  Watson  JD,  Yan  PS,  Barsyte-Lovejoy  D,  Khosravi  F,  Wong  WW, 
Farnham  PJ,  Huang  TH,  Penn  LZ:  Analysis  of  Myc  bound  loci  identified  by 
CpG  island  arrays  shows  that  Max  is  essential  for  Myc-dependent 
repression.  Curr  Biol  2003,  13(10):882-886. 


32 


SUBMITTED 


13.  Pokholok  DK,  Zeitlinger  J,  Hannett  NM,  Reynolds  DB,  Young  RA:  Activated 
signal  transduction  kinases  frequently  occupy  target  genes.  Science  2006, 
313(5786):533-536. 

14.  Squazzo  SL,  O'Geen  H,  Komashko  VM,  Krig  SR,  Jin  VX,  Jang  SW,  Margueron 
R,  Reinberg  D,  Green  R,  Farnham  PJ:  Suzl2  binds  to  silenced  regions  of  the 
genome  in  a  cell-type-specific  manner.  Genome  Res  2006,  16(7):  890-900. 

15.  Weinmann  AS,  Yan  PS,  Oberley  MJ,  Huang  TH,  Farnham  PJ:  Isolating  human 
transcription  factor  targets  by  coupling  chromatin  immunoprecipitation  and 
CpG  island  microarray  analysis.  Genes  Dev  2002,  16(2):235-244. 

16.  Ishkanian  AS,  Malloff  CA,  Watson  SK,  DeLeeuw  RJ,  Chi  B,  Coe  BP,  Snijders  A, 
Albertson  DG,  Pinkel  D,  Marra  MA  et  ah  A  tiling  resolution  DNA  microarray 
with  complete  coverage  of  the  human  genome.  Nat  Genet  2004,  36(3):299-303. 

17.  Takayama  K,  Kaneshiro  K,  Tsutsumi  S,  Horie-Inoue  K,  Ikeda  K,  Urano  T,  Ijichi 
N,  Ouchi  Y,  Shirahige  K,  Aburatani  H  et  ah  Identification  of  novel  androgen 
response  genes  in  prostate  cancer  cells  by  coupling  chromatin 
immunoprecipitation  and  genomic  microarray  analysis.  Oncogene  2007. 

18.  Guo  QM,  Malek  RL,  Kim  S,  Chiao  C,  He  M,  Ruffy  M,  Sanka  K,  Lee  NH,  Dang 
C V,  Liu  ET:  Identification  of  c-myc  responsive  genes  using  rat  cDNA 
microarray.  Cancer  Res  2000,  60(21):5922-5928. 

19.  Kannan  K,  Kaminski  N,  Rechavi  G,  Jakob-Hirsch  J,  Amariglio  N,  Givol  D:  DNA 
microarray  analysis  of  genes  involved  in  p53  mediated  apoptosis:  activation 
of  Apaf-1.  Oncogene  2001,  20(26):3449-3455. 

20.  O'Connell  BC,  Cheung  AF,  Simkevich  CP,  Tam  W,  Ren  X,  Mateyak  MK,  Sedivy 
JM:  A  large  scale  genetic  analysis  of  c-Myc-regulated  gene  expression 
patterns.  J  Biol  Client  2003,  278(14):  12563- 12573. 

21.  Stanelle  J,  Stiewe  T,  Theseling  CC,  Peter  M,  Putzer  BM:  Gene  expression 
changes  in  response  to  E2F1  activation.  Nucleic  Acids  Res  2002,  30(8):  1859- 
1867. 

22.  Watson  JD,  Oster  SK,  Shago  M,  Khosravi  F,  Penn  LZ:  Identifying  genes 
regulated  in  a  Myc-dependent  manner.  J  Biol  Cliem  2002,  277(40) : 36921 - 
36930. 

23.  Oberley  MJ,  Farnham  PJ:  Probing  chromatin  immunoprecipitates  with  CpG- 
island  microarrays  to  identify  genomic  sites  occupied  by  DNA-binding 
proteins.  Methods  Enzymol  2003,  371:577-596. 

24.  Ren  B,  Dynlacht  BD:  Use  of  chromatin  immunoprecipitation  assays  in 
genome- wide  location  analysis  of  mammalian  transcription  factors.  Methods 
Enzymol  2004,  376:304-315. 

25.  Oberley  MJ,  Tsao  J,  Yau  P,  Farnham  PJ:  High-throughput  screening  of 
chromatin  immunoprecipitates  using  CpG-island  microarrays.  Methods 
Enzymol  2004,  376:315-334. 

26.  Paris  J,  Virtanen  C,  Lu  Z,  Takahashi  M:  Identification  of  MEF2-regulated 
genes  during  muscle  differentiation.  Physiol  Genomics  2004,  20(  1 ):  143-15 1. 

27.  Liu  CL,  Schreiber  SL,  Bernstein  BE:  Development  and  validation  of  a  T7  based 
linear  amplification  for  genomic  DNA.  BMC  Genomics  2003,  4(1):  19. 

28.  Shi  L,  Reid  LH,  Jones  WD,  Shippy  R,  Warrington  JA,  Baker  SC,  Collins  PJ,  de 
Longue ville  F,  Kawasaki  ES,  Lee  KY  et  ah  The  MicroArray  Quality  Control 


33 


SUBMITTED 


(MAQC)  project  shows  inter-  and  intraplatform  reproducibility  of  gene 
expression  measurements.  Nat  Biotechnol  2006,  24(9):  115 1-1 161. 

29.  Wei  C,  Li  J,  Bumgarner  RE:  Sample  size  for  detecting  differentially  expressed 
genes  in  microarray  experiments.  BMC  Genomics  2004,  5(1):87. 

30.  Heisler  LE,  Torti  D,  Boutros  PC,  Watson  J,  Chan  C,  Winegarden  N,  Takahashi 
M,  Yau  P,  Huang  TH,  Farnham  PJ  et  ah  CpG  Island  microarray  probe 
sequences  derived  from  a  physical  library  are  representative  of  CpG  Islands 
annotated  on  the  human  genome.  Nucleic  Acids  Res  2005,  33(9):2952-2961. 

31.  Birnie  GD:  The  HL60  cell  line:  a  model  system  for  studying  human  myeloid 
cell  differentiation.  Br  J  Cancer  Suppl  1988,  9:41-45. 

32.  Branham  WS,  Melvin  CD,  Han  T,  Desai  VG,  Moland  CL,  Scully  AT,  Fuscoe  JC: 
Elimination  of  laboratory  ozone  leads  to  a  dramatic  improvement  in  the 
reproducibility  of  microarray  gene  expression  measurements.  BMC 
Biotechnol  2007,  7:8. 

33.  Fare  TL,  Coffey  EM,  Dai  H,  He  YD,  Kessler  DA,  Kilian  KA,  Koch  JE,  LeProust 
E,  Marton  MJ,  Meyer  MR  et  ah  Effects  of  atmospheric  ozone  on  microarray 
data  quality.  Anal  Client  2003,  75(17):4672-4675. 

34.  Bachtiary  B,  Boutros  PC,  Pintilie  M,  Shi  W,  Bastianutto  C,  Li  JH,  Schwock  J, 
Zhang  W,  Penn  LZ,  Jurisica  I  et  ah  Gene  expression  profiling  in  cervical 
cancer:  an  exploration  of  intratumor  heterogeneity.  Clin  Cancer  Res  2006, 
12(19):5632-5640. 

35.  Vigano  MA,  Lamartine  J,  Testoni  B,  Merico  D,  Alotto  D,  Castagnoli  C,  Robert  A, 
Candi  E,  Melino  G,  Gidrol  X  et  ah.  New  p63  targets  in  keratinocytes  identified 
by  a  genome-wide  approach.  Embo  J  2006. 

36.  Myc  target  gene  database.  In. :  http://www.myc-cancer- 
gene.org/site/mycTargetDB.asp. 

37.  Zeller  KI,  Jegga  AG,  Aronow  BJ,  O'Donnell  KA,  Dang  C V:  An  integrated 
database  of  genes  responsive  to  the  Myc  oncogenic  transcription  factor: 
identification  of  direct  genomic  targets.  Genome  Biol  2003,  4(10):R69. 

38.  Krieg  AJ,  Hammond  EM,  Giaccia  AJ:  Functional  analysis  of  p53  binding 
under  differential  stresses.  Mol  Cell  Biol  2006,  26(19):7030-7045. 

39.  Semeralul  MO,  Boutros  PC,  Likhodi  O,  Okey  AB,  Van  Tol  HH,  Wong  AH: 
Microarray  analysis  of  the  developing  cortex.  J  Neurobiol  2006. 

40.  Dobbin  K,  Shih  JH,  Simon  R:  Statistical  design  of  reverse  dye  microarrays. 
Bioinformatics  2003,  19(7):803-8 10. 

41.  Dobbin  KK,  Kawasaki  ES,  Petersen  DW,  Simon  RM:  Characterizing  dye  bias 
in  microarray  experiments.  Bioinformatics  2005,  21(10):2430-2437. 

42.  He  YD,  Dai  H,  Schadt  EE,  Cavet  G,  Edwards  SW,  Stepaniants  SB,  Duenwald  S, 
Kleinhanz  R,  Jones  AR,  Shoemaker  DD  et  ah  Microarray  standard  data  set 
and  figures  of  merit  for  comparing  data  processing  methods  and  experiment 
designs.  Bioinformatics  2003,  19(8):956-965. 

43.  Dafforn  A,  Chen  P,  Deng  G,  Herrler  M,  Iglehart  D,  Koritala  S,  Lato  S,  Pillarisetty 
S,  Purohit  R,  Wang  M  et  ah  Linear  mRNA  amplification  from  as  little  as  5  ng 
total  RNA  for  global  gene  expression  analysis.  Biotechniques  2004,  37(5):854- 
857. 

44.  Marko  NF,  Frank  B,  Quackenbush  J,  Lee  NH:  A  robust  method  for  the 


34 


SUBMITTED 


amplification  of  RNA  in  the  sense  orientation.  BMC  Genomics  2005,  6(1):27. 

45.  Dang  CV,  van  Dam  H,  Buckmire  M,  Lee  WM:  DNA-binding  domain  of  human 
c-Myc  produced  in  Escherichia  coli.  Mol  Cell  Biol  1989,  9(6):2477-2486. 

46.  Pan  W,  Lin  J,  Le  CT:  How  many  replicates  of  arrays  are  required  to  detect 
gene  expression  changes  in  microarray  experiments?  A  mixture  model 
approach.  Genome  Biol  2002,  3(5):research0022. 

47.  Peixoto  BR,  Vencio  RZ,  Egidio  CM,  Mota-Vieira  L,  Verjovski- Almeida  S,  Reis 
EM:  Evaluation  of  reference-based  two-color  methods  for  measurement  of 
gene  expression  ratios  using  spotted  cDNA  microarrays.  BMC  Genomics 
2006,  7:35. 

48.  Shih  JH,  Michalowska  AM,  Dobbin  K,  Ye  Y,  Qiu  TH,  Green  JE:  Effects  of 
pooling  mRNA  in  microarray  class  comparisons.  Bioinformatics  2004, 

20(  1 8):33 1 8-3325. 

49.  Immunoprecipitated  DNA  Amplification  In.  Edited  by 
http://www.microarravs.ca/support/PDFs/ChIP  Protocol  for  Microarray  Analysi 

s  Beads  corr  01-12-06.pdf. 

50.  Gentleman  RC,  Carey  VJ,  Bates  DM,  Bolstad  B,  Dettling  M,  Dudoit  S,  Ellis  B, 
Gautier  L,  Ge  Y,  Gentry  J  et  ah  Bioconductor:  open  software  development  for 
computational  biology  and  bioinformatics.  Genome  Biol  2004,  5(10):R80. 

5 1 .  Smyth  GK:  Linear  models  and  empirical  bayes  methods  for  assessing 
differential  expression  in  microarray  experiments.  Stat  Appl  Genet  Mol  Biol 
2004,  3:Article3. 

52.  Stajich  JE,  Block  D,  Boulez  K,  Brenner  SE,  Chervitz  SA,  Dagdigian  C,  Fuellen 
G,  Gilbert  JG,  Korf  I,  Lapp  H  et  al:  The  Bioperl  toolkit:  Perl  modules  for  the 
life  sciences.  Genome  Res  2002,  12(  10):161 1-1618. 

53.  Tatusova  TA,  Madden  TL:  BLAST  2  Sequences,  a  new  tool  for  comparing 
protein  and  nucleotide  sequences.  FEMS  Microbiol  Lett  1999,  174(2):247-250. 

54.  Karolchik  D,  Baertsch  R,  Diekhans  M,  Furey  TS,  Hinrichs  A,  Lu  YT,  Roskin 
KM,  Schwartz  M,  Sugnet  CW,  Thomas  DJ  et  ah.  The  UCSC  Genome  Browser 
Database.  Nucleic  Acids  Res  2003,  31(  1):5 1-54. 

55.  Altschul  SF,  Madden  TL,  Schaffer  AA,  Zhang  J,  Zhang  Z,  Miller  W,  Lipman  DJ: 
Gapped  BLAST  and  PSI-BLAST:  a  new  generation  of  protein  database 
search  programs.  Nucleic  Acids  Res  1997,  25(17):3389-3402. 


35 


SUBMITTED 


Figure  Legends 


Figure  1:  Experimental  design  to  evaluate  antibody  effect,  array  batch,  reciprocal 

labeling  and  hybridization.  Schematic  of  the  experimental  design  in  which  six 
independent  biological  replicates  of  cross-linked  HL60  cells  were  subjected  to  ChIP  with 
either  commercial  or  home-made  antibodies  raised  against  the  N-terminal  262  residues  of 
the  Myc  oncoprotein.  Each  sample  was  hybridized  to  individual  arrays  with  its  matched 
No  Ab  control.  Dashed  arrows  indicate  biological  replicates  #2  to  #5,  not  included  in  the 
figure  for  simplicity.  Black  arrows  indicate  reciprocal  labeling.  Dotted  lines  indicate  the 
period  of  seven  days  in  which  the  48  arrays  were  hybridized. 


Figure  2:  Measuring  antibody  effect 

(A)  Comparison  of  ChIP-chip  results  using  a  commercial  and  home-made  polyclonal 
antibody  raised  against  the  same  portion  of  Myc  protein  showed  a  similar  linear  trend  and 
ordering  by  the  two  antibodies  (p  <  2.2  x  10  16;  Wilcoxon  Rank  Test),  however,  they 
differed  by  orders  of  magnitude  in  sensitivity  (axis  scales).  (B)  Purification  of  the  home¬ 
made  antibody  preparation  improved  sensitivity.  The  line  represents  equal  sensitivity  (y 
=  x).  Note  that  orders  of  magnitude  on  the  axes  differ  between  parts  A  and  B  because  the 
replicate  numbers  are  different  (A,  n  =  24;  B,  n  =  4). 


Figure  3:  Evaluating  array  batch  effects,  dye-bias,  and  inter-day  processing 

To  assess  the  effects  of  array  batch,  dye-swap,  and  hybridization  date  we  fitted  separate 
spot-wise  linear  models  to  the  data  from  our  study  for  each  of  these  three  parameters. 
Gaussian  kernel  densities  were  then  plotted  for  the  coefficients  of  the  fits  for  each  model. 
Each  curve  is  unimodal,  with  a  sharp  peak  around  the  origin,  indicating  that  most  spots 
on  the  array  do  not  vary  with  these  design  parameters.  The  spot  counts  along  the  bottom 
of  the  figure  further  quantify  the  degree  of  variability  at  several  thresholds  (0.5=  1.4-fold, 
1.0  =  2.0-fold,  1.5  =  2.8-fold,  2.  0=  4.0-fold). 


Figure  4:  Comparing  DNA  amplification  methods 

To  evaluate  whether  specific  DNA  amplification  protocols  introduce  different  degrees  of 
bias,  we  surveyed  the  genome-wide  effects  of  three  amplification  protocols.  A)  Total 
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genomic  DNA  was  amplified  with  either  random  priming  (RP-PCR)  or  one  of  two  LM- 
PCR  protocols  (LM-PCR-A  and  LM-PCR-B).  The  resulting  DNA  was  then  hybridized 
to  a  CpG  Island  microarray  against  an  equal  amount  of  unamplified  total  genomic  DNA. 
This  procedure  was  repeated  four  times  for  each  of  the  three  amplification  methods,  and 
the  raw  data  was  pre-processed  with  VSN  and  model-based  t-tests  (see  Methods).  Black 
arrows  indicate  reciprocal  labeling.  B)  The  fraction  of  spots  on  the  array  that  are 
differentially  amplified  (fraction  of  spots  called)  is  plotted  against  the  p-value  threshold 
for  each  amplification  method.  The  curve  for  the  first  LM-PCR  protocol  (LM-PCR  A) 
lies  below  that  of  the  other  two  methods  for  all  p-values,  demonstrating  that  this  method 
has  the  smallest  amplification  bias  of  the  three  protocols  tested. 


Figure  5:  Comparing  hybridization  controls 

A  number  of  potential  control  samples  can  be  used  for  ChIP-chip  studies.  (A)  Three 
common  approaches  were  compared,  including  two  direct  hybridization  approaches  (No 
Ab  and  IgG)  and  an  indirect  hybridization  approach  (No  Ab).  For  each  approach  ChIP- 
chip  studies  of  Myc  in  HL60  cells  were  performed.  Black  arrows  indicate  reciprocal 
labeling.  (B)  The  sensitivity  of  each  approach  was  plotted  as  the  percentage  of  spots 
identified  as  differentially  bound  vs.  the  p-value.  While  the  three  methods  displayed 
similar  performance,  the  two  direct  methods  show  higher  sensitivity  than  the  indirect 
method. 


Figure  6:  Myc  recruitment  to  regulatory  regions  of  novel  target  genes  in  growing 
HL60  cells. 

ChIP  assays  with  antibody  to  Myc  and  IgG  control  antibody  from  exponentially  growing 
HL60  cells.  DNA  was  Q-PCR  amplified  by  using  primers  against  a  positive  (Cyclin  D2) 
and  a  negative  control  (Chromosome  21  E-Box),  as  well  as  primers  against  five 
sequences  from  the  top  third  of  the  list  of  hits,  primers  against  four  sequences  from  the 
middle  third  of  the  list  of  hits,  and  primers  against  four  selected  from  the  bottom  third  of 
the  list  of  hits.  Array  Rank  was  determined  by  p-values.  GDAP2 :  Ganglioside  Induced 
Differentiation  Associated  Protein  2.  RPL34\  Ribosomal  Protein  34.  PHLPP\  PH  Domain 
and  Leucine  Rich  Repeat  Protein  Phosphatase.  TCERG1 :  Transcription  Elongation 
Regulator  1.  PKIB :  Protein  Kinase  (cAMP-dependent,  catalytic)  Inhibitor  Beta.  CDC14A: 
Cell  Division  Cycle  14A.  OGGI:  8-oxoguanine  DNA  Glycosylase.  CDCA2 :  Cell 
Division  Cycle  Associated  2.  NOL6 :  Nucleolar  Protein  Family  6  (RNA-associated). 
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PLTP\  Phospholipid  Transfer  Protein.  TRH:  Thyrotropin- Re  leasing  Hormone.  KLC4 : 
Kinesin  Light  Chain  4.  GASP.  Growth  Arrest-Specific  1. 

(1)  and  (2)  indicates  two  primer  pairs  against  different  regions  of  the  designated  target 
gene  promoter. 

The  mean  log2IMyc/IgGI  values  are  plotted  for  each  target  gene,  and  the  Chr21  E-box. 
Error  bars  represent  standard-deviations.  Targets  with  p  <  0.05  are  marked  with  a  *, 
statistical  significance  as  assessed  using  a  paired  t  test. 


Please  be  aware  that  3  additional  data  files  will  be  included  with  the  manuscript: 
Supplementary  tables  1,  2  and  3 
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chrl  9:50620370-50620427 

42 

1353 

Downstream 

UHNhscpg0007470 

0.00237 

0.527 

2 

chrl  :321 68088-321 68241 

74 

8334 

Upstream 

UHNhscpg0002483 

0.00243 

0.296 

3 

chrl  6:851 45935-851 46230 

280 

112 

UHNhscpgOOOl  574 

0.00256 

1 

1 

chr6:1 44458002-1 4445871 8 

713 

0 

Overlapping 

UHNhscpgOOOl  709 

0.00257 

0.922 

2 

chr3:44664986-44665759 

770 

0 

UHNhscpg0006569 

0.00259 

1 

1 

chrl  1:1 11 254731 -11 1255533 

803 

0 

Overlapping 

UHNhscpgOOOl  158 

0.00265 

1 

1 

chrl  7:39935622-39936669 

1024 

0 

UHNhscpg0002640 

0.0027 

1 

1 

chr2:85501 234-85502047 

810 

10047 

Downstream 

UHNhscpg0000920 

0.00282 

1 

1 

chrl  1:621 15235-621 15803 

565 

0 

UHNhscpg0006739 

0.00282 

0.236 

2 

chrl  :3653933-3654076 

144 

883 

Downstream 

UHNhscpgOOOl  81 4 

0.00288 

0.268 

13 

chr9:44341 81 6-44342357 

471 

1217010 

jBMlJ.MiraillMg 

UHNhscpg0002399 

0.0029 

1 

1 

chr5:1 40730098-1 40730485 

384 

0 

Overlapping 

UHNhscpg0002080 

0.00299 

0.187 

2 

chrl  9:1 37461 70-1 3746588 

395 

0 

UHNhscpgOOOl  890 

0.00302 

0.869 

2 

chrl  2:51 975674-51 976234 

534 

173 

Upstream 

UHNhscpg0006364 

0.00302 

1 

1 

chrl  9:9740462-9740746 

285 

52 

Downstream 

UHNhscpg0003776 

0.00305 

1 

1 

chrX:38545425-38545774 

350 

2242 

Downstream 

UHNhscpg0000897 

0.00317 

0 

66 

NA 

NA 

UHNhscpg0009677 

0.00318 

0.871 

2 

chrl  7:72065094-72065566 

457 

17328 

Upstream 

UHNhscpg00061 24 

0.00332 

0.271 

2 

chr9:95748240-95748309 

70 

9114 

UHNhscpg0000478 

0.00335 

0.797 

2 

chr2:1 1301 5948-1 1301 6337 

390 

0 

UHNhscpgOOOl  754 

0.00347 

0.897 

2 

chrl  7:781 551 08-781 5571 5 

400 

43851 

UB.lJil-iU-l.luW 

UHNhscpg0002457 

0.0035 

0.752 

3 

chr5:1 25958442-1 25959093 

636 

0 

UHNhscpg0008746 

0.00353 

1 

1 

chrl  :1 64064064-1 64064435 

225 

551 

nmMMim 

UHNhscpgOOOl  954 

0.0036 

0 

12 

NA 

NA 

UHNhscpg0002604 

0.0036 

0.85 

3 

chrl  9:32976290-32976943 

439 

1419011 

UHNhscpg0004396 

0.00369 

0.935 

2 

chrl  8:550901 06-55091 038 

922 

567 

■BMW 

UHNhscpg0008244 

0.00375 

0.897 

2 

chr4:841 75025-841 75531 

507 

0 

UHNhscpgOOOl  908 

0.00377 

1 

1 

chrl  2:1 1 9239392-1 1 9240071 

665 

9904 

■SsSRjSI 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpg0002101 

0.0038 

0.694 

2 

chr9:351 06261 -351 06931 

647 

368 

Downstream 

UHNhscpg0002273 

0.0038 

1 

1 

chrl  2:1 02489049-1 02489597 

537 

15601 

Downstream 

UHNhscpgOOl  1595 

0.00385 

0 

35 

NA 

NA 

UHNhscpgOOOl  182 

0.0039 

0 

13 

NA 

NA 

UHNhscpg0009185 

0.0039 

0 

5 

chrl  :1 48070730-1 48071 003 

266 

0 

Overlapping 

UHNhscpg0005267 

0.00406 

1 

1 

chrl  9:1 2695458-1 2696086 

629 

1380 

Downstream 

UHNhscpg00071 92 

0.00413 

1 

1 

chr6:24829288-24829774 

463 

1906 

Downstream 

UHNhscpgOOl  0640 

0.00414 

1 

1 

chrl  6:81 828084-81 828454 

367 

570639 

Downstream 

UHNhscpg0000808 

0.00424 

1 

1 

chr7:451 17391-451 18238 

844 

0 

Overlapping 

UHNhscpgOOOl  31 5 

0.00435 

1 

1 

chr2:207338359-207338723 

353 

64 

Downstream 

UHNhscpg0006036 

0.00435 

1 

1 

chr22:4034741 2-40347900 

489 

172 

Upstream 

UHNhscpg0006692 

0.0044 

0 

39 

NA 

NA 

UHNhscpg0000663 

0.00441 

0 

4 

chr7:1 43029357-1 43029674 

94 

691 

Upstream 

UHNhscpg0008592 

0.00454 

1 

1 

chr6:1 22972820-1 22973031 

132 

44 

Downstream 

UHNhscpg000271 8 

0.00459 

1 

1 

chr4:53926306-5392701 8 

701 

0 

Overlapping 

UHNhscpgOOOl  461 

0.00466 

1 

1 

chrl  1 :1 24241 472-12424201 0 

527 

981 

IKBUnflS 

UHNhscpgOOl  1747 

0.00466 

1 

1 

chrl  3:46268564-46268875 

308 

493 

Upstream 

UHNhscpg000401 5 

0.00489 

1 

1 

chr2:1 07581 563-1 07581 793 

219 

387633 

Downstream 

UHNhscpg00021 58 

0.00491 

0.762 

2 

chr2:1 27999967-1 28000281 

311 

233 

Upstream 

UHNhscpg0004300 

0.00491 

1 

1 

chrl  3:3576991 7-35770383 

467 

48070 

HBE 

UHNhscpg0004008 

0.00495 

1 

1 

chrl  2:1 1 95031 84-1 1 950401 3 

810 

0 

Overlapping 

UHNhscpg0008423 

0.00501 

0.883 

2 

chr6:1 1 6998749-1 1 6999278 

530 

0 

UHNhscpg0008176 

0.00502 

0.458 

3 

chr3:472991 03-4729961 5 

513 

0 

Overlapping 

UHNhscpg0009268 

0.00504 

0.909 

2 

chr2:7381 7431 -7381 8211 

769 

0 

UHNhscpg0004645 

0.00508 

1 

1 

chr6:43089334-430901 21 

758 

0 

Overlapping 

UHNhscpg0002437 

0.00514 

0.624 

8 

chrl  9:63022721  -63023265 

521 

4628 

Downstream 

UHNhscpg0000389 

0.00519 

1 

1 

chr9:94095629-94096224 

572 

0 

Overlapping 

UHNhscpg0005425 

0.00523 

1 

1 

chr22:36533363-36533799 

429 

101 

Downstream 

UHNhscpg0002339 

0.00526 

0.908 

2 

chrl  5:61 236270-61 236994 

665 

0 

Overlapping 

UHNhscpg0004394 

0.00533 

0.357 

4 

chrl  9:57334937-573351 21 

185 

0 

UHNhscpg0003606 

0.00537 

1 

1 

chr8:96350447-96351 060 

614 

0 

Overlapping 

UHNhscpg0009520 

0.0054 

1 

1 

chrl  :91 738839-91 739081 

243 

0 

UHNhscpg0008070 

0.00549 

1 

1 

chrl  :1 91 4901 12-191 490214 

99 

67765 

Downstream 

UHNhscpgOOOl  041 

0.00566 

1 

1 

chrl  :1 54808695-1 548091 77 

479 

0 

UHNhscpg0009274 

0.0057 

0 

9 

NA 

NA 

UHNhscpg0005239 

0.00582 

1 

1 

chrl  9:635901 1 4-63590420 

264 

27 

Downstream 

UHNhscpg0005459 

0.00584 

1 

1 

chr6:1 53345899-1 5334671 7 

784 

32 

Downstream 

UHNhscpg0004074 

0.00593 

0 

2 

NA 

NA 

UHNhscpg0006096 

0.00594 

0.621 

3 

chrl  2:948601 44-94860524 

282 

35 

Upstream 

UHNhscpg000561 4 

0.00599 

0.549 

2 

chrl  0:32256976-322571 08 

133 

668 

UHNhscpgOOOl  279 

0.00607 

1 

1 

chr2:1 564901 4-1 5649865 

808 

0 

Overlapping 

UHNhscpg0002006 

0.00611 

1 

1 

chrl  :68288598-682891 25 

452 

0 

UHNhscpgOOOl  133 

0.00616 

1 

1 

chrl  :1 54808695-1 548091 77 

479 

0 

Overlapping 

UHNhscpg0008635 

0.00617 

0.339 

2 

chr2:45008927-45008985 

59 

13555 

Downstream 

UHNhscpg0002795 

0.0064 

1 

1 

chrl  9:50601 483-506021 03 

349 

177 

Upstream 

UHNhscpgOOl  0736 

0.0064 

1 

1 

chrl  6:81 828084-81 828447 

360 

570646 

Downstream 

UHNhscpg000031 4 

0.00645 

0.655 

5 

chrl  1 :85633827-85634078 

252 

365 

Upstream 

UHNhscpgOOOl  205 

0.00655 

0.507 

2 

chrl  7:38793349-38793666 

302 

38212 

Downstream 

UHNhscpgOOOl  143 

0.00659 

0 

43 

NA 

NA 

UHNhscpgOOOl  237 

0.00664 

1 

1 

chrl  5:67673499-67674364 

842 

141287 

UHNhscpg0000480 

0.00665 

1 

1 

chr7:1 3531 2099-1 3531 291 5 

727 

0 

UHNhscpg0004701 

0.00682 

0.757 

2 

chr4:3046987-3047 1 67 

181 

782 

UHNhscpgOOOl  967 

0.00689 

1 

1 

chrl  :1 00589367-1 0058971 5 

345 

919 

Downstream 

UHNhscpg000091 5 

0.00691 

0.106 

2 

chrl  9:5061 9268-5061 931 4 

47 

251 

Downstream 

UHNhscpg0005276 

0.00704 

0.74 

2 

chr6:30646907-30647379 

473 

0 

UHNhscpgOOOl  848 

0.00705 

0 

34 

NA 

NA 

UHNhscpgOOOl  795 

0.00714 

1 

1 

chr22:45536661  -455371 1 5 

431 

97 

Downstream 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpg0008428 

0.00717 

0 

2 

NA 

NA 

UHNhscpg0004303 

0.00718 

0.563 

33 

chr6:27883373-27883888 

508 

0 

UHNhscpg0001809 

0.00728 

0 

15 

NA 

NA 

UHNhscpgOOOOl  19 

0.00737 

0.41 

13 

chrl  7:25951 567-25951 939 

373 

40858 

UHNhscpg0009385 

0.00744 

0.572 

2 

chr3:9765933-9766477 

545 

176 

Downstream 

UHNhscpg0000272 

0.00749 

0.457 

4 

chr5:1 31 853530-1 31 853938 

405 

388 

S9!m 

UHNhscpg0000705 

0.00772 

0.486 

2 

chrl  4:51 396904-51 3971 51 

210 

16908 

Downstream 

UHNhscpg0000925 

0.00779 

1 

1 

chr20:1 1257476-1 1257777 

235 

561699 

Downstream 

UHNhscpg0000449 

0.00785 

0 

13 

NA 

NA 

UHNhscpg0002569 

0.00786 

0.761 

4 

chrl  9:49337662-49337941 

268 

45 

imsK 

UHNhscpg0000556 

0.00787 

0.834 

2 

chr5:89805381 -898061 66 

763 

175 

Upstream 

UHNhscpgOOOl  206 

0.0079 

0 

29 

chrl  :1 6865365-1 6865789 

377 

52796 

Downstream 

UHNhscpg0008723 

0.00807 

0 

13 

NA 

NA 

UHNhscpg0008688 

0.00814 

0.62 

3 

chrl  :23761 6558-23761 6870 

305 

242125 

Downstream 

UHNhscpgOOOl  530 

0.00823 

1 

1 

chr7:1 3995333-1 3995759 

251 

44 

Downstream 

UHNhscpg0003878 

0.00823 

1 

1 

chr2:238539828-238540325 

474 

113 

Downstream 

UHNhscpgOOOl  947 

0.00824 

0.784 

2 

chrl  6:1 4073060-1 4073523 

444 

364 

Upstream 

UHNhscpg0009557 

0.00831 

0.711 

2 

chr6:961 31 747-961 32044 

294 

89 

Downstream 

UHNhscpg0009765 

0.00834 

0 

63 

NA 

NA 

UHNhscpgOOOl  955 

0.00848 

1 

1 

chrl  :21 0274778-21 02761 1 0 

1234 

0 

UHNhscpg0002254 

0.00853 

0.681 

2 

chr9:7789295-7789855 

524 

0 

Overlapping 

UHNhscpgOOl  1787 

0.00857 

1 

1 

chr6:1 06548952-1 06549065 

114 

91822 

Downstream 

UHNhscpg0000048 

0.00864 

0.614 

3 

chrl  8:31 177840-31 178483 

621 

0 

Overlapping 

UHNhscpg0002200 

0.00864 

1 

1 

chrl  4:76993362-769941 31 

758 

0 

UHNhscpg0000491 

0.00873 

0.965 

2 

chr6:1 59340256-1 59341 432 

1121 

0 

Overlapping 

UHNhscpg0002926 

0.00886 

0 

43 

NA 

NA 

UHNhscpg0000927 

0.00887 

1 

1 

chrl  7:701 57894-701 58326 

425 

20538 

Downstream 

UHNhscpgOOOl  162 

0.0091 

0 

23 

NA 

NA 

UHNhscpg0005505 

0.00912 

1 

1 

chrl  6:31 1 341 23-31 135151 

994 

745 

Upstream 

UHNhscpg0005734 

0.00917 

0 

35 

NA 

NA 

UHNhscpg0000278 

0.00923 

1 

1 

chr3:4864731 3-48648039 

727 

0 

Overlapping 

UHNhscpg0002243 

0.00929 

0.912 

2 

chrl  5:61 236270-61 236994 

697 

0 

UHNhscpg0002446 

0.0093 

0 

13 

NA 

NA 

UHNhscpgOOl  1406 

0.0093 

1 

1 

chrl  1:590751 09-590751 58 

50 

36148 

UHNhscpgOOl  0874 

0.00933 

1 

1 

chrl  3:46268564-46268875 

308 

493 

Upstream 

UHNhscpg0000286 

0.00943 

1 

1 

chr6:30748320-30748935 

529 

0 

UHNhscpg0007078 

0.00949 

1 

1 

chr5:1 15317061-1 15317139 

79 

9030 

Downstream 

UHNhscpg0006940 

0.0095 

1 

1 

chrl  9:9295687-9295872 

170 

55 

Downstream 

UHNhscpgOOl  1843 

0.00966 

1 

1 

chrl  3:46268564-46268875 

308 

493 

Upstream 

UHNhscpgOOOl  113 

0.00993 

0.476 

2 

chr7:2078371 4-20784421 

656 

8609 

awMiraiiil 

UHNhscpgOOOl  527 

0.00997 

1 

1 

chr20:89961 80-8996731 

541 

28200 

Downstream 

UHNhscpgOOOOl  68 

0.01009 

0.085 

2 

chrl  3:99431 681 -99431 833 

153 

486 

Downstream 

UHNhscpgOOOl  933 

0.01012 

0.263 

2 

chrl  5:80342224-8034271 1 

380 

18 

Upstream 

UHNhscpg0000809 

0.01015 

0.823 

5 

chrl  9:6261 4206-6261 4920 

711 

0 

UHNhscpg0000882 

0.01027 

0 

17 

NA 

NA 

UHNhscpg0009269 

0.01029 

1 

1 

chr2:21 8940536-21 8941 1 48 

597 

10714 

SiHliB 

UHNhscpgOOOl  322 

0.01035 

0 

2 

chr5:69381 228-69381 607 

376 

123 

Upstream 

UHNhscpg0000372 

0.01038 

1 

1 

chr4:76868579-76869553 

939 

0 

UHNhscpg00091 93 

0.01047 

1 

1 

chrl  0:89567438-89567728 

239 

169 

UHNhscpgOOOl  154 

0.01053 

1 

1 

chrl  7:1 7080205-1 7080795 

587 

426 

UHNhscpgOOOl  888 

0.01055 

1 

1 

chr22:45536661  -455371 1 5 

431 

97 

Downstream 

UHNhscpgOOOl  569 

0.01057 

1 

1 

chr6:1 38230630-1 38231 068 

427 

357 

■!BSH 

UHNhscpgOOOl  578 

0.01076 

1 

1 

chrl  8:73075889-73076383 

483 

14337 

Downstream 

UHNhscpgOOOl  506 

0.01078 

0.607 

2 

chrl  3:26723246-26723668 

415 

23 

Downstream 

UHNhscpg0000937 

0.01094 

0 

19 

NA 

NA 

UHNhscpg0007203 

0.011 

0.746 

2 

chrl  :3531 7368-3531 7832 

456 

0 

UHNhscpg0000658 

0.01118 

0.865 

2 

chrl  0:79458886-79459376 

436 

0 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpg00031 55 

0.01122 

0.033 

35 

chr4:49209349-49209607 

215 

526328 

Upstream 

UHNhscpg0006329 

0.01128 

0.453 

9 

chrl  :67668270-67668930 

653 

0 

UHNhscpg0008734 

0.01146 

1 

1 

chrl  0:1 22729028-1 2272931 1 

280 

128344 

Upstream 

UHNhscpg000721 2 

0.01183 

0.858 

2 

chrl  7:34983441  -34984081 

641 

33620 

UHNhscpg0002673 

0.01195 

1 

1 

chrl  5:81 8251 25-81 825456 

235 

80653 

Downstream 

UHNhscpgOOOl  300 

0.012 

0 

2 

chr5:69381 228-69381 607 

372 

123 

UHNhscpg0001291 

0.01203 

1 

1 

chrl  1:621 15235-621 15799 

479 

0 

Overlapping 

UHNhscpg0000596 

0.0122 

0.872 

3 

chrl  1:9601 38-961 029 

797 

44298 

UHNhscpgOOl  1233 

0.0123 

0 

47 

NA 

NA 

UHNhscpgOOOl  228 

0.01239 

1 

1 

chrl  :1 54808695-1 548091 77 

479 

0 

UHNhscpgOOl  1824 

0.01243 

0.847 

2 

chrl  6:21 44955-21 451 83 

229 

616 

Downstream 

UHNhscpg0005883 

0.01259 

1 

1 

chrl  :1 0381 074-1 0382309 

1229 

0 

UHNhscpg0005465 

0.01273 

1 

1 

chr6:71 049050-71 04971 9 

670 

0 

Overlapping 

UHNhscpg0000441 

0.01291 

0.567 

2 

chrl  3:40604530-4060521 1 

670 

0 

UHNhscpg0002181 

0.01293 

0.494 

2 

chr8:1 32985502-1 32985928 

427 

35401 

Downstream 

UHNhscpg0004594 

0.01294 

1 

1 

chrl  :1 607341 1 1  -1 60734402 

255 

0 

UHNhscpg0002608 

0.01299 

0.925 

3 

chr9:1 1 6389649-1 1 6390322 

666 

0 

Overlapping 

UHNhscpgOOOl  075 

0.01308 

1 

1 

chr6:27594840-27595208 

357 

45982 

Downstream 

UHNhscpg000761 9 

0.01313 

1 

1 

chrl  6:841 44824-841 45260 

433 

59164 

Downstream 

UHNhscpg0002828 

0.01322 

0.183 

2 

chr22:25086487-25086628 

142 

68673 

Downstream 

UHNhscpg0004797 

0.01325 

1 

1 

chr4:8572241 3-85722792 

380 

288 

Downstream 

UHNhscpg0009903 

0.01325 

0.851 

3 

chrl  1  77026049-77026697 

649 

0 

UHNhscpgOOOl  122 

0.01327 

0.231 

2 

chr4:1 531 86392-1 531 8661 2 

221 

284796 

Downstream 

UHNhscpgOOOl  657 

0.01331 

1 

1 

chr3:1821 13380-1821 13899 

504 

235 

UHNhscpgOOOl  302 

0.01335 

0.199 

4 

chrl  2:47531 849-4753221 8 

362 

6 

Upstream 

UHNhscpgOOOl  222 

0.01337 

0.485 

6 

chrl  :1 474231 76-1 47423307 

132 

18539 

Downstream 

UHNhscpgOOOl  628 

0.01339 

0.021 

10 

chr6:58249504-58249834 

331 

145821 

Upstream 

UHNhscpgOOOl  391 

0.01345 

0 

29 

chrl  :1 6865365-1 6865789 

377 

52796 

Downstream 

UHNhscpgOOOl  120 

0.01349 

0.526 

2 

chr3:1 1 6859797-1 1 6860278 

458 

34656 

Upstream 

UHNhscpgOOOOl  73 

0.01379 

1 

1 

chr4:1 00703869-1 00704376 

504 

21 

Downstream 

UHNhscpg0007087 

0.01409 

1 

1 

chrl  9:51 983258-51 983854 

597 

0 

Overlapping 

UHNhscpg0000825 

0.0143 

1 

1 

chrl  1:621 15235-621 15799 

561 

0 

UHNhscpg0007223 

0.0147 

0.476 

3 

chrl  1:351 171 68-351 17692 

521 

176 

Upstream 

UHNhscpgOOl  0071 

0.01472 

1 

1 

chrl  0:91 9601 46-91 960343 

173 

508800 

UHNhscpg000391 9 

0.01477 

1 

1 

chr2:1 07581 552-1 07581 794 

243 

387632 

Downstream 

UHNhscpgOOl  1489 

0.01482 

0.414 

83 

chrl  9:20995209-20995601 

389 

0 

UHNhscpg0002088 

0.01512 

0.131 

11 

chr2:71 21 0238-71 21 0899 

650 

0 

Overlapping 

UHNhscpg0003021 

0.0156 

1 

1 

chr5:56242005-56242320 

196 

1146 

■91BH 

UHNhscpgOOOl  776 

0.01569 

0.607 

2 

chrl  3:26723246-26723668 

415 

23 

Downstream 

UHNhscpg0004258 

0.01584 

1 

1 

chr8:1 2881 5427-1 2881 5697 

271 

1800 

Downstream 

UHNhscpg0002269 

0.01591 

1 

1 

chrl  6:1 871 9891 -18721 065 

1151 

0 

Overlapping 

UHNhscpg0008332 

0.01603 

0 

2 

NA 

NA 

UHNhscpg0002738 

0.01607 

0.183 

2 

chr22:25086487-25086628 

142 

68673 

Downstream 

UHNhscpg0005886 

0.01634 

0 

106 

NA 

NA 

UHNhscpg0006144 

0.01637 

1 

1 

chrl  2:551 48527-551 48849 

323 

0 

Overlapping 

UHNhscpg0000953 

0.01638 

0.193 

11 

chr2:1 321 47266-1 321 48290 

1001 

47109 

mm 

UHNhscpg0006819 

0.01664 

1 

1 

chrl  :81 71 1 875-8171 1915 

41 

326754 

Downstream 

UHNhscpg000351 3 

0.01669 

1 

1 

chr8:96350435-96351 060 

626 

0 

UHNhscpg0004688 

0.01682 

1 

1 

chr8:1 07738736-1 07738968 

233 

301 

Downstream 

UHNhscpgOOOl  41 9 

0.01688 

1 

1 

chrl  9:53990095-53990948 

819 

15165 

UHNhscpg0004354 

0.01698 

1 

1 

chr8:1 2881 5427-1 2881 5694 

256 

1803 

Downstream 

UHNhscpgOOOl  974 

0.01712 

0.907 

11 

chrl  9:60589086-60589859 

755 

0 

UHNhscpgOOOl  103 

0.01726 

1 

1 

chrl  9:1 151 0292-1 151 0795 

409 

0 

UHNhscpg00041 92 

0.01728 

1 

1 

chr3:25799228-2579941 3 

186 

513 

■sum 

UHNhscpg0000332 

0.01766 

0.007 

3 

chr3:1 97850403-1 97850877 

446 

175 

Downstream 

UHNhscpgOOOl  298 

0.01777 

0 

29 

chrl  :1 6865365-1 6865789 

377 

52796 

Downstream 

UHNhscpg0002046 

0.01788 

1 

1 

chrl  :21 0274778-21 02761 1 0 

1289 

0 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpg0000402 

0.01803 

0.807 

3 

chrl  0:70330982-70331 591 

602 

0 

Overlapping 

UHNhscpg0006392 

0.01804 

1 

1 

chr5:1 0302703-1 0303007 

301 

0 

UHNhscpg0006618 

0.01824 

1 

1 

chr9:4974346-4974895 

550 

349 

Downstream 

UHNhscpg000371 7 

0.01829 

1 

1 

chr2:1 87058898-1 87059935 

950 

0 

UHNhscpgOOOl  256 

0.01845 

0 

23 

NA 

NA 

UHNhscpg0002247 

0.01848 

0.915 

2 

chrl  6:3079351 1-307941 78 

644 

18722 

■9BB 

UHNhscpgOOOl  008 

0.01882 

0 

11 

NA 

NA 

UHNhscpg0003872 

0.01901 

1 

1 

chrX:38545421  -38545774 

350 

2242 

Downstream 

UHNhscpg00081 92 

0.01905 

1 

1 

chrl  0:1 03532635-1 0353321 5 

581 

0 

Overlapping 

UHNhscpg0000531 

0.01946 

1 

1 

chr4:1 861 71 924-1 861 731 47 

1181 

3929 

Downstream 

UHNhscpg0000072 

0.01947 

0 

23 

NA 

NA 

UHNhscpg0000468 

0.01948 

0.007 

23 

chrl  :1 4599901 7-1 45999561 

541 

66522 

UHNhscpg000081 3 

0.01957 

0.595 

3 

chrl  6:287431 36-2874341 6 

274 

1222 

Upstream 

UHNhscpgOOOl  492 

0.0196 

0 

12 

NA 

NA 

UHNhscpg0000220 

0.01962 

0.728 

2 

chr2:1 88864982-1 8886571 9 

738 

0 

Overlapping 

UHNhscpg0002563 

0.01971 

0.058 

5 

chrl  7:1 890781 5-1 8907957 

139 

16754 

Downstream 

UHNhscpg0010709 

0.01981 

1 

1 

chr5:31 567438-31 567801 

364 

124 

Upstream 

UHNhscpg0005462 

0.02007 

0.947 

2 

chrl  4:1 04706380-1 047071 44 

749 

174 

Downstream 

UHNhscpg00021 45 

0.02014 

0.461 

2 

chrl  :1 1 6762694-1 1 6763009 

308 

45336 

Upstream 

UHNhscpg0007737 

0.02019 

1 

1 

chrl  1:1 11 254752-1 11 255541 

653 

0 

UHNhscpgOOOl  058 

0.02026 

1 

1 

chrl  9:42021 242-42021 797 

544 

121 

Downstream 

UHNhscpgOOl  0828 

0.02046 

1 

1 

chr2:237377856-237377932 

55 

234738 

BwairaiiiM 

UHNhscpg0002025 

0.0206 

1 

1 

chrl  :47683502-47683914 

393 

9227 

Upstream 

UHNhscpg0000308 

0.02079 

0.829 

2 

chr6:1 21 697252-1 21 697969 

718 

0 

UHNhscpg0005353 

0.02088 

1 

1 

chrl  3:72253948-72254560 

549 

0 

Overlapping 

UHNhscpgOOOl  906 

0.02096 

0.749 

119 

chrl  5:6032401 3-60324682 

650 

3604 

■sm 

UHNhscpg0002545 

0.02111 

0 

11 

NA 

NA 

UHNhscpg0009036 

0.02111 

1 

1 

chr8:86281 84-8628292 

95 

31109 

UHNhscpgOOOl  636 

0.02115 

0.318 

2 

chr9:1 7032661 -17032855 

195 

92182 

Downstream 

UHNhscpgOOOl  536 

0.0213 

0 

43 

NA 

NA 

UHNhscpgOOOl  653 

0.0215 

0.493 

88 

chrl  9:23661 339-23661 91 4 

537 

0 

Overlapping 

UHNhscpg0008841 

0.0215 

0.816 

2 

chrl  9:607201 45-60720879 

724 

965 

Downstream 

UHNhscpg0007225 

0.02167 

1 

1 

chr8:25372386-253731 82 

797 

0 

Overlapping 

UHNhscpg0002003 

0.02185 

0.776 

2 

chr8:98724881  -98725277 

393 

305 

Downstream 

UHNhscpg0008297 

0.02192 

1 

1 

chrl  4:8991 9066-899201 56 

1069 

12969 

Downstream 

UHNhscpg0009796 

0.02195 

1 

1 

chr4:1 84000942-1 84001 1 80 

239 

74444 

UHNhscpg0004468 

0.02197 

0.005 

4 

chrl  7:41 794271 -41 795059 

773 

0 

Overlapping 

UHNhscpg0000824 

0.02201 

0.033 

2 

chrl  1:6541 2243-6541 2610 

273 

0 

UHNhscpg0002458 

0.02224 

0.522 

3 

chrl  2:1 1 1 860306-1 1 1 860954 

602 

0 

Overlapping 

UHNhscpg0004793 

0.02228 

1 

1 

chr3:1 3896921 6-1 38969571 

356 

2948 

UHNhscpg0002453 

0.02271 

1 

1 

chr2:1 37238469-1 3723871 0 

122 

646274 

Downstream 

UHNhscpgOOOl  067 

0.02292 

0 

23 

NA 

NA 

UHNhscpg0006728 

0.023 

1 

1 

chr6:30247427-30248400 

970 

7987 

Upstream 

UHNhscpg0006752 

0.02303 

1 

1 

chr14:3897181 1-38972105 

270 

440 

Downstream 

UHNhscpg0006451 

0.02308 

0.532 

61 

chr7 :6936836-6937722 

862 

1 04450 

Downstream 

UHNhscpg0007253 

0.02309 

0.004 

327 

chrl  :1 44034037-1 4403501 4 

955 

89620 

Downstream 

UHNhscpg0002847 

0.02319 

0.94 

2 

chr8:224751 90-22475966 

765 

3193 

Downstream 

UHNhscpg0000526 

0.02346 

0.531 

27 

chrl  :1 47490240-1 47490606 

303 

64371 

Downstream 

UHNhscpg0002450 

0.02366 

0 

11 

NA 

NA 

UHNhscpg000261 2 

0.02371 

0 

8 

chrl  :1 333261 7-1 3332708 

92 

35132 

Downstream 

UHNhscpg0002381 

0.02405 

0.045 

23 

chr4:90731 18-9073165 

44 

17761 

UHNhscpg0009176 

0.02406 

0.751 

10 

chrl  9:631 37964-631 38541 

570 

11 

UHNhscpg000201 0 

0.02434 

0.689 

2 

chr9:351 06261 -351 06931 

637 

368 

Downstream 

UHNhscpg000271 1 

0.02444 

0 

13 

NA 

NA 

UHNhscpg0003020 

0.02447 

0.642 

6 

chr7:1 4841 8256-14841 9067 

802 

0 

UHNhscpg0009776 

0.0245 

0 

64 

NA 

NA 

UHNhscpgOOOl  278 

0.0248 

1 

1 

chrl  8:5302801 1  -53028577 

563 

142141 

Downstream 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpgOOOl  348 

0.02512 

0 

23 

NA 

NA 

UHNhscpg0007695 

0.02514 

0.958 

2 

chrl  9:1 4500799-1 45021 79 

1373 

0 

UHNhscpg0000406 

0.02518 

1 

1 

chr8:71 109380-71 109747 

364 

36369 

Upstream 

UHNhscpgOOl  1125 

0.02525 

1 

1 

chr7:1 51 2031 04-1 51 203495 

384 

1754 

UHNhscpg0000902 

0.02532 

1 

1 

chr7:451 17391 -451 18238 

844 

0 

Overlapping 

UHNhscpg0002658 

0.02533 

0.704 

69 

chrl  9:21 48021 7-21 480792 

480 

0 

UHNhscpgOOOl  21 6 

0.0254 

0 

42 

NA 

NA 

UHNhscpg000091 4 

0.02571 

0 

11 

NA 

NA 

UHNhscpg000861 5 

0.02583 

0.03 

5 

chrl  :40009081 -40009875 

795 

0 

Overlapping 

UHNhscpg0008287 

0.02595 

1 

1 

chr9:33463227-3346421 2 

958 

0 

UHNhscpg0005841 

0.02605 

1 

1 

chrl  1:331 39663-331 3991 7 

255 

50 

Downstream 

UHNhscpg0002429 

0.02617 

0 

13 

NA 

NA 

UHNhscpgOOOl  688 

0.02627 

1 

1 

chrl  7:67859932-67860337 

406 

231177 

Upstream 

UHNhscpgOOOl  468 

0.0263 

0 

10 

NA 

NA 

UHNhscpg0003078 

0.02633 

0.729 

2 

chrl  5:301 09990-301 10251 

262 

0 

Overlapping 

UHNhscpg0000097 

0.02645 

0.519 

2 

chrl  7:341 56724-341 57336 

609 

748 

UHNhscpg0005079 

0.02653 

0.86 

2 

chrl  :1 52229727-1 52230020 

257 

0 

Overlapping 

UHNhscpg0007545 

0.02667 

1 

1 

chrl  1:1 8699693-1 8700388 

668 

3965 

UHNhscpg0000099 

0.0268 

0 

7 

chr7 :74909483-74909824 

338 

31902 

Upstream 

UHNhscpg0002803 

0.02687 

0 

13 

NA 

NA 

UHNhscpg0002031 

0.02689 

1 

1 

chrl  1:60566343-6056651 6 

174 

60026 

Downstream 

UHNhscpg0002091 

0.02724 

1 

1 

chrl  1 :3240591 7-32406464 

540 

7199 

UHNhscpg00021 63 

0.02736 

0.066 

8 

chrl  7:63708274-6370841 0 

137 

47329 

Downstream 

UHNhscpgOOOl  392 

0.0274 

1 

1 

chrl  8:321 15023-321 15237 

190 

16462 

Downstream 

UHNhscpg0002263 

0.02765 

0.296 

3 

chr5:1 689431 26-1 68943760 

615 

0 

Overlapping 

UHNhscpg0000239 

0.02789 

0.219 

2 

chr2:1 28331 843-1 283321 1 0 

260 

64 

UHNhscpgOOOl  51 8 

0.02797 

1 

1 

chrl  7:76480469-76480901 

421 

99388 

Downstream 

UHNhscpg0000537 

0.02798 

0 

15 

NA 

NA 

UHNhscpg0002041 

0.02811 

1 

1 

chrl  3:781 31 007-781 31 426 

400 

0 

Overlapping 

UHNhscpgOOOl  049 

0.02812 

1 

1 

chr9:1 1 8445890-1 1 844601 1 

118 

43323 

UHNhscpg0000594 

0.02828 

0.635 

2 

chr5:1 72588938-1 72589503 

550 

5365 

Upstream 

UHNhscpgOOOl  681 

0.02845 

0.913 

2 

chr8:1 1 0725288-1 1 0725809 

518 

208 

Downstream 

UHNhscpg0000618 

0.02856 

1 

1 

chrl  2:30739053-30739432 

380 

586 

Upstream 

UHNhscpg0000653 

0.02867 

1 

1 

chr6:31 97351 6-31 9741 44 

613 

73 

Downstream 

UHNhscpg0005500 

0.02886 

0 

13 

NA 

NA 

UHNhscpg0007227 

0.02896 

1 

1 

chrl  9:5004201 5-50042472 

458 

783 

UHNhscpgOOOl  121 

0.02899 

0 

44 

NA 

NA 

UHNhscpgOOOl  828 

0.02924 

0.774 

2 

chrl  7:1 7507496-1 7507756 

257 

17755 

Downstream 

UHNhscpg0004425 

0.02928 

1 

1 

chrl  7:56828620-56828930 

252 

3108 

Downstream 

UHNhscpgOOl  0504 

0.02939 

0.845 

2 

chrl  7:595771 94-59577547 

354 

15986 

Downstream 

UHNhscpgOOl  1809 

0.02948 

0 

47 

NA 

NA 

UHNhscpgOOOl  051 

0.02949 

0 

43 

NA 

NA 

UHNhscpg0004022 

0.02978 

1 

1 

chr3:25253404-25253585 

182 

191172 

Downstream 

UHNhscpg0007487 

0.02995 

1 

1 

chrl  8:50049685-5005031 3 

617 

0 

UHNhscpgOOOl  443 

0.02996 

0.325 

2 

chr2:1 76842521  -1 76842785 

197 

139 

Upstream 

UHNhscpg0007328 

0.03004 

0.106 

3 

chrl  :75374734-75375251 

518 

8028 

99HE 

UHNhscpg0000475 

0.03017 

1 

1 

chr7:990521 15-99052926 

621 

0 

Overlapping 

UHNhscpg0002255 

0.03042 

0 

13 

NA 

NA 

UHNhscpg000901 4 

0.03049 

0.358 

6 

chrl  1:79651 50-796551 7 

360 

0 

UHNhscpgOOOl  960 

0.03076 

1 

1 

chr8:1 01 928783-1 01 929400 

602 

1 04047 

UHNhscpg0005607 

0.03079 

1 

1 

chrl  8:72335447-72335597 

141 

195376 

Downstream 

UHNhscpg0007898 

0.03103 

1 

1 

chr3:239341 08-23934307 

153 

466 

UHNhscpg0008726 

0.03138 

1 

1 

chr4:1 981 480-1 981 586 

107 

933 

Downstream 

UHNhscpg0007508 

0.03139 

0 

12 

chrl  6:33863882-33864008 

86 

5996 

■Mu® 

UHNhscpg000041 2 

0.03167 

0.832 

2 

chrl  9:9980882-9981 159 

238 

988 

■Si 

UHNhscpgOOl  0798 

0.03172 

1 

1 

chrl  9:50703548-50703901 

346 

1021 

■m 

UHNhscpg0005699 

0.0319 

0.72 

2 

chr7:27226471  -27226888 

418 

20221 

Downstream 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpgOOOl  274 

0.03248 

1 

1 

chrl  1 :1 24241 472-12424201 0 

527 

981 

Upstream 

UHNhscpg0003954 

0.03252 

0 

9 

NA 

NA 

UHNhscpg000261 7 

0.03266 

0.494 

3 

chrl  :46956756-46957301 

542 

22 

Upstream 

UHNhscpg0006532 

0.03267 

1 

1 

chr4:1 781 2603-1 781 2754 

111 

180129 

Downstream 

UHNhscpg0006831 

0.03268 

1 

1 

chr2:1 69743028-1 697431 21 

94 

111238 

Upstream 

UHNhscpg0004677 

0.03283 

0 

39 

NA 

NA 

UHNhscpg0000722 

0.03288 

0 

13 

NA 

NA 

UHNhscpg0010009 

0.03314 

0 

13 

NA 

NA 

UHNhscpg00071 24 

0.0334 

0.015 

3 

chrl  2:9491 372-9491 683 

268 

160081 

Upstream 

UHNhscpg000631 9 

0.03377 

1 

1 

chrl  7:75392265-753931 44 

872 

6780 

Downstream 

UHNhscpg00091 18 

0.034 

1 

1 

chrl  4:361 23320-361 23588 

262 

1783 

Downstream 

UHNhscpg0008799 

0.03402 

1 

1 

chr2:66526261  -66526432 

172 

10226 

UHNhscpg0007040 

0.03403 

0.823 

2 

chr20:439731 92-43973549 

350 

644 

Upstream 

UHNhscpgOOl  1745 

0.03414 

0 

13 

NA 

NA 

UHNhscpgOOl  0375 

0.03421 

1 

1 

chr2:85051 032-85051 165 

134 

592 

Downstream 

UHNhscpg0000037 

0.03437 

0.04 

3 

chrl :  144537487- 144538576 

1090 

98405 

Msssinfli 

UHNhscpgOOOl  509 

0.03457 

0.033 

2 

chrl  :255361 25-2553661 5 

491 

782 

Downstream 

UHNhscpg0008984 

0.0346 

0.024 

4 

chr8:57286072-572861 68 

85 

224 

EwMiranil 

UHNhscpgOOOl  034 

0.03467 

1 

1 

chr5:1 72646777-1 72647068 

284 

42044 

Upstream 

UHNhscpg0009228 

0.03473 

0.613 

2 

chrl  7:1 41 39087-1 41 39267 

181 

5963 

Downstream 

UHNhscpg0002720 

0.03477 

1 

1 

chrl  :144106952-144107310 

355 

17324 

Downstream 

UHNhscpg00021 75 

0.0348 

1 

1 

chr3:1 2500629-1 2501 447 

743 

0 

UHNhscpg00041 93 

0.035 

1 

1 

chrl  1:731 67868-731 68609 

742 

7399 

Downstream 

UHNhscpg0006853 

0.035 

1 

1 

chr3:27731 181-27731384 

204 

7405 

IKBUnflE 

UHNhscpg0002783 

0.03517 

0.051 

3 

chr7:61 41 6908-6141 6950 

39 

2243969 

Upstream 

UHNhscpg0000567 

0.03527 

1 

1 

chr6:41 997067-41 997790 

712 

125 

■sum 

UHNhscpgOOOl  759 

0.03556 

0.622 

3 

chr4:1 2044091 6-1 20441 1 82 

259 

53 

Upstream 

UHNhscpg0002523 

0.03578 

0 

13 

NA 

NA 

UHNhscpg000301 9 

0.03581 

1 

1 

chr22:30769207-30769490 

276 

0 

Overlapping 

UHNhscpg000031 1 

0.03604 

0.952 

4 

chr8:1 04495884-1 04497258 

1360 

0 

UHNhscpg0002250 

0.03612 

1 

1 

chr6:1 3421 63-1 34251 2 

342 

7096 

Upstream 

UHNhscpg0000461 

0.03659 

1 

1 

chr6:1 44458000-1 4445871 8 

703 

0 

UHNhscpg0009957 

0.03675 

0 

52 

NA 

NA 

UHNhscpg0009207 

0.0368 

0.092 

3 

chrl  9:51 61 1 502-51 611610 

109 

2821 

Downstream 

UHNhscpg0000467 

0.03682 

0.661 

2 

chrl  9:4059857-4060284 

416 

14842 

Upstream 

UHNhscpg0008485 

0.03683 

1 

1 

chr7:99536509-99537172 

644 

0 

UHNhscpgOOOl  559 

0.03687 

0.007 

3 

chrl  :1 1 71 9003-1 171 9321 

304 

275 

Upstream 

UHNhscpg0000908 

0.037 

0.661 

2 

chr2:1 5649057-1 5649583 

493 

0 

UHNhscpg000071 2 

0.03702 

0.531 

27 

chrl  :1 47490240-1 47490606 

303 

64371 

Downstream 

UHNhscpg0000071 

0.03758 

0.638 

4 

chr2:8551 7938-8551 8249 

312 

2510 

UHNhscpg0005989 

0.03775 

1 

1 

chr6:71 049050-71 04971 9 

670 

0 

Overlapping 

UHNhscpgOOOl  170 

0.03787 

1 

1 

chrl  5:93894702-93894930 

229 

780019 

Downstream 

UHNhscpg0007290 

0.038 

0.586 

2 

chr5:1 48705064-1 48705229 

162 

0 

Overlapping 

UHNhscpg0000991 

0.03823 

0 

66 

NA 

NA 

UHNhscpg0003024 

0.0387 

1 

1 

chrl  9:58388353-58388523 

159 

0 

Overlapping 

UHNhscpg0000427 

0.03874 

1 

1 

chr4:1 28773336-1 28773809 

462 

0 

UHNhscpg0007080 

0.03902 

0.8 

2 

chrl  1 :442891 00-44289535 

420 

808 

Downstream 

UHNhscpgOOl  1727 

0.0391 

1 

1 

chrl  8:23362839-23362920 

82 

343560 

Downstream 

UHNhscpg0002688 

0.03923 

0.586 

4 

chr8:5821 8508-5821 8861 

350 

149551 

Downstream 

UHNhscpg000261 9 

0.03942 

0 

13 

NA 

NA 

UHNhscpg0002287 

0.03951 

0.103 

3 

chr3:1 5930601 1  -1 59306237 

223 

393 

UHNhscpg0002232 

0.03955 

1 

1 

chr4:41 631 740-41 63241 3 

630 

0 

UHNhscpgOOOl  81 2 

0.03963 

0.728 

2 

chr6:1 071 83890-1071 841 54 

265 

0 

UHNhscpgOOl  0967 

0.03976 

0.657 

3 

chr5:1 322301 56-1 32230697 

522 

0 

UHNhscpg0002424 

0.03977 

1 

1 

chr4:41 631 740-41 63241 3 

637 

0 

UHNhscpg0002498 

0.03981 

1 

1 

chr7:291 51 988-291 52700 

689 

0 

UHNhscpgOOOl  181 

0.04024 

0.908 

6 

chrl  9:49308930-49309510 

573 

0 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpg0007027 

0.04046 

1 

1 

chr4:1 331 9471 1-133194772 

62 

1095147 

Downstream 

UHNhscpg0001446 

0.0405 

1 

1 

chr9:1 39550967-1 39551 440 

466 

13217 

jgwMiraillf 

UHNhscpg0002583 

0.04072 

0.738 

4 

chr9:1 35272274-1 35273404 

1098 

0 

Overlapping 

UHNhscpg0006063 

0.04078 

1 

1 

chr5:1 77344333-1 77344745 

409 

11104 

UHNhscpg0009352 

0.04112 

0.353 

2 

chr6:53331 731 -53332080 

329 

9830 

Downstream 

UHNhscpgOOl  0654 

0.04112 

1 

1 

chr1:1 13609455-1 13609661 

207 

125232 

Downstream 

UHNhscpg0002671 

0.04115 

0.701 

2 

chr9:351 06261 -351 06934 

662 

368 

Downstream 

UHNhscpg0000345 

0.04138 

0.701 

4 

chrl  9:6171 061 5-6171 1261 

643 

0 

UHNhscpg0001956 

0.04141 

1 

1 

chr9:70925408-70925639 

228 

53269 

Downstream 

UHNhscpgOOOl  599 

0.04142 

1 

1 

chr8:5691 9645-56920023 

363 

34902 

Downstream 

UHNhscpg0007760 

0.04142 

1 

1 

chrl  8:72099907-721 00284 

358 

430689 

Downstream 

UHNhscpgOOl  0529 

0.04178 

1 

1 

chr4:24625820-24625958 

135 

15455 

fa^u.Miranit 

UHNhscpg0004076 

0.0418 

1 

1 

chr3:201 37200-201 3731 3 

114 

65374 

Upstream 

UHNhscpg0002171 

0.04187 

0.154 

2 

chrl  9:1 37461 70-1 3746588 

395 

0 

UHNhscpg0002431 

0.04193 

0.96 

2 

chr8:1 2881 9638-1 28820520 

871 

2141 

Upstream 

UHNhscpg0002925 

0.04196 

1 

1 

chrl  8:4461 5052-4461 5385 

330 

115694 

UHNhscpg0004653 

0.04196 

1 

1 

chr2:21 8940536-21 8941 1 48 

505 

10714 

Upstream 

UHNhscpg0007377 

0.0421 

0 

13 

NA 

NA 

UHNhscpg00091 40 

0.0422 

1 

1 

chr3:1 31 175815-131 175943 

125 

309 

Downstream 

UHNhscpgOOOl  496 

0.04229 

0.588 

3 

chr9:83397626-833981 80 

514 

95236 

UHNhscpgOOl  161 5 

0.04245 

1 

1 

chr2:1 00004565-1 00004600 

36 

83877 

Upstream 

UHNhscpgOOl  1499 

0.04246 

0 

35 

NA 

NA 

UHNhscpgOOOl  557 

0.04292 

1 

1 

chr2:37053-37637 

573 

217234 

Downstream 

UHNhscpg0007029 

0.04293 

0.648 

13 

chrl  5:76023501  -760241 60 

644 

132752 

mBunK 

UHNhscpg0005936 

0.04297 

1 

1 

chr21 :46905649-46905872 

224 

25695 

Upstream 

UHNhscpgOOOl  684 

0.04298 

1 

1 

chr5:1 261 70332-1 261 70372 

41 

29601 

UHNhscpg0005235 

0.04298 

0 

35 

NA 

NA 

UHNhscpg000881 9 

0.0433 

0.812 

2 

chr21 :37274394-37274976 

579 

9397 

UHNhscpg0006842 

0.0434 

0.409 

2 

chr2:66655720-66655966 

247 

139685 

Upstream 

UHNhscpg0000304 

0.04359 

0.217 

30 

chr6:26231 71 2-26231 909 

198 

202 

tumunfli: 

UHNhscpg0007532 

0.04367 

1 

1 

chrl  8:72099905-721 00286 

382 

430687 

Downstream 

UHNhscpg0000928 

0.04373 

1 

1 

chrl  8:321 15023-321 15237 

190 

16462 

Downstream 

UHNhscpgOOOl  767 

0.04378 

0 

13 

NA 

NA 

UHNhscpg0008596 

0.04378 

0.665 

2 

chrl  6:29541 54-2954698 

534 

0 

UHNhscpg0000507 

0.04387 

1 

1 

chrl  7:261 82897-261 83794 

868 

0 

Overlapping 

UHNhscpg0007226 

0.04421 

0 

4 

chr6:26829529-26829956 

428 

61788 

Downstream 

UHNhscpgOOOl  135 

0.04433 

0.799 

4 

chrl  9:49220922-49221 654 

725 

0 

Overlapping 

UHNhscpg0009141 

0.0448 

1 

1 

chr4:1 0293061 4-1 02931 464 

851 

0 

UHNhscpg0000090 

0.04487 

0.627 

2 

chr8:1 01 391 596-101392235 

606 

93 

Downstream 

UHNhscpgOOl  0678 

0.04496 

1 

1 

chr5:1 26305083-1 263051 46 

60 

88999 

UHNhscpg0000919 

0.04501 

1 

1 

chr5:1 40709706-1 4071 0379 

650 

0 

Overlapping 

UHNhscpg0002267 

0.04507 

1 

1 

chr7:91 347862-91 348424 

559 

0 

UHNhscpg0004664 

0.04534 

0.568 

2 

chrl  :45822221 -45822486 

266 

0 

Overlapping 

UHNhscpg0008339 

0.04534 

1 

1 

chr6:431 35442-431 36002 

553 

0 

UHNhscpg00041 49 

0.04545 

0 

34 

NA 

NA 

UHNhscpgOOOl  796 

0.04549 

0.922 

2 

chr3:44664988-44665757 

766 

0 

UHNhscpg0002797 

0.0456 

0 

4 

chr6:26829522-26829956 

431 

61781 

Downstream 

UHNhscpg00081 25 

0.0456 

0.741 

2 

chrl  :51 206642-51 206811 

170 

143 

Downstream 

UHNhscpg0007632 

0.04593 

1 

1 

chr3:3851 261 5-3851 31 27 

513 

0 

UHNhscpg00061 03 

0.04606 

1 

1 

chrl  1:15628805-15629035 

231 

535746 

UHNhscpg0008837 

0.04606 

1 

1 

chrl  5:206381 28-20638397 

266 

251 

Downstream 

UHNhscpg0007720 

0.04609 

1 

1 

chrl  :94085305-94085670 

264 

31660 

UHNhscpg0004534 

0.04644 

0 

4 

chrl  :1 48079924-1 48080282 

347 

535 

Downstream 

UHNhscpg0009990 

0.04657 

1 

1 

chr7 :43932064-43932260 

197 

261 

UHNhscpg0006327 

0.04683 

1 

1 

chr2:21 1 025765-21 1 025896 

132 

23780 

M*!  1  UN  1 

UHNhscpg0007529 

0.04688 

0 

13 

NA 

NA 

UHNhscpgOOl  1699 

0.0469 

1 

1 

chrl  5:83775792-83775966 

171 

50918 

BW  1  ili  liiW 

P  Values 


Sequence  ID 

Myc 

Specificity 

Score 

Total 

Hits 

Location 

Length 

Genel 

Distance 

Genel  Direction 

UHNhscpg0009281 

0.04694 

0 

4 

chrl  :1 48070730-1 48071 003 

262 

0 

Overlapping 

UHNhscpgOOl  1367 

0.04699 

1 

1 

chrl  3:67622766-67622831 

58 

920302 

Downstream 

UHNhscpg0002979 

0.04706 

1 

1 

chrl  7:1 534808-1 535275 

389 

0 

Overlapping 

UHNhscpg0003746 

0.04723 

1 

1 

chrl  2:551 67738-551 68353 

616 

95 

UHNhscpg0006082 

0.04726 

0.444 

2 

chr9:88752268-88752501 

234 

344 

Downstream 

UHNhscpg0006698 

0.04739 

0.632 

4 

chr8:65656638-65656998 

361 

1271 

UHNhscpg0001885 

0.04762 

1 

1 

chrl  5:99009624-9901 031 4 

651 

49267 

Upstream 

UHNhscpg0007131 

0.04773 

0.746 

2 

chr20:439731 92-43973549 

287 

644 

UHNhscpgOOl  071 9 

0.04774 

1 

1 

chr6:82520050-82520703 

654 

903 

Downstream 

UHNhscpg0004253 

0.04794 

0 

35 

NA 

NA 

UHNhscpgOOl  0955 

0.04809 

0.189 

41 

chrl  :1 71 03856-1 71 04232 

217 

16799 

Downstream 

UHNhscpg0002647 

0.04816 

0.942 

2 

chr4:1 09760868-1 09761 763 

868 

0 

UHNhscpg0007824 

0.04826 

0 

4 

chrl  5:41 669082-41 669582 

501 

2961 

Downstream 

UHNhscpg0007068 

0.04836 

1 

1 

chrl  :1 1 7403485-1 1 7404038 

353 

433 

Downstream 

UHNhscpg0004050 

0.04865 

0 

9 

NA 

NA 

UHNhscpgOOl  1201 

0.04871 

0.613 

2 

chr5:87992642-87993052 

411 

221728 

UHNhscpg0002481 

0.04901 

1 

1 

chrl  :1 73258979-1 73259568 

550 

0 

Overlapping 

UHNhscpgOOOl  494 

0.04904 

0.609 

2 

chr3:47841 476-47842283 

764 

0 

UHNhscpgOOOl  520 

0.04912 

0.114 

2 

chrl  0:1 24886983-1 24887034 

44 

10593 

Downstream 

UHNhscpgOOl  101 3 

0.04939 

1 

1 

chr3:820831 00-82083256 

157 

189665 

Downstream 

UHNhscpg0000880 

0.04948 

0.456 

9 

chrl  :67668277-67668926 

599 

0 

Overlapping 

UHNhscpgOOl  1339 

0.04952 

1 

1 

chr5:34721 601 -34721 739 

131 

29327 

HSSSBBI 

Best  Hit 


Genel  Symbol  Genel  ID 


UQCRFS1 


AMZ1 


TLE1 


KIAA0090 


TRAPPC6A 


METTL1 


UQCRFS1 


FAM90A1 


DUS3L 


CSDA 


SMARCA2 


ZNF225 


DAP 


DUS3L 


ZNF557 


SNRPB 


ZNF430 


RPL23A 


KBTBD7 


GDAP2 


C20orf39 


NA 


RLN2 


LOC441426 


RPL34 


RPS16 


DDX1 


DAP 


SMARCA2 


SGOL1 


TMEM16E 


NA 


ZNF552 


SERPINB9 


HOXB2 


MANE  A 


LOC642265 


SIN3A 


PTBP1 


UBE2I 


BCLAF1 


TRIM26 


TUT1 


ZNF518 


EXOD1 


INTS7 


UBE2D2 


GNB2L1 


ANKMY2 


NA 


CBX4 


DAP 


WRB 


DHX37 


PAK1IP1 


7386 


155185 


7088 


23065 


79090 


4234 


7386 


55138 


56931 


8531 


6595 


7768 


1611 


56931 


79230 


6628 


80264 


6147 


84078 


54834 


79953 


6019 


441426 


6164 


6217 


1653 


1611 


6595 


151648 


203859 


79818 


5272 


3212 


79694 


642265 


25942 


5725 


7329 


9774 


7726 


64852 


9849 


112479 


25896 


7322 


10399 


57037 


8535 


1611 


7485 


57647 


55003 


Gene2 

Distance 


1812097 


103226 


395328 


0 


259 


329 


1812121 


54582 


5855 


47849 


463671 


18594 


527773 


6454 


49473 


38741 


61247 


2003 


61340 


387 


265312 


35390 


1218276 


0 


9456 


30118 


527773 


605949 


65374 


144091 


29815 


62155 


12026 


438545 


82300 


85104 


22041 


1035 


39619 


44995 


10080 


86987 


0 


0 


77935 


17012 


697 


40562 


527773 


31110 


4362 


27903 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


Upstream 


Downstream 


Overlapping 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Upstream 


Downstream 


Downstream 


Upstream 


Overlapping 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


Downstream 


Downstream 


POP4 


TTYH3 


FLJ46321 


C1orf33 


BLOC1S3 


METTL1 


POP4 


ZNF705A 


MGC24975 


STYK1 


VLDLR 


ZNF224 


ROPN1L 


MGC24975 


MBD3L2 


ZNF343 


ZNF714 


RAB34 


KBTBD6 


WDR3 


CST7 


NA 


RLN1 


ANKRD20A3 


RPL34 


SUPT5H 


NAG 


ROPN1L 


VLDLR 


SGOL1 


SLC17A6 


NA 


ZNF587 


SERPINB6 


HOXB1 


FUT9 


FAM74A1 


MAN2C1 


PRG2 


UBE2I 


FAM54A 


FLJ45422 


MTA2 


CCNJ 


LOC81691 


DTL 


TMEM173 


TRIM52 


BZW2 


NA 


CBX8 


ROPN1L 


HMGN1 


BRI3BP 


TMEM14C 


10775 


80727 


389763 


51154 


388552 


4234 


10775 


440077 


163154 


55359 


7436 


7767 


83853 


163154 


125997 


79175 


148206 


83871 


89890 


10885 


8530 


6013 


441425 


6164 


6829 


51594 


83853 


7436 


151648 


57084 


84914 


5269 


3211 


10690 


401507 


4123 


79948 


7329 


113115 


441140 


9219 


54619 


81691 


51514 


340061 


84851 


28969 


57332 


83853 


3150 


140707 


51522 


Best  E- 
Box 


NA 


0.001 


1 


0.804 


0.005 


0.404 


NA 


0 


0.011 


0.574 


0.069 


NA 


0.006 


0 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


0 


NA 


0 


0.006 


NA 


NA 


0 


NA 


0.001 


NA 


NA 


NA 


0 


NA 


NA 


NA 


NA 


0.001 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


1 


0.006 


NA 


0.062 


0.012 


Best  Hit 


Genel  Symbol  Genel  ID 


Gene2 

Distance 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


ZNF708 


NA 


CLASP1 


SERPINB9 


NUP155 


PHLPP 


DDX11 


MAP3K6 


CCRN4L 


USP48 


MDN1 


RPS27L 


LOC441242 


SGOL2 


BAI2 


RPL38 


NA 


MTX3 


ZNF222 


ELF1 


MLL5 


PROK2 


LOC441426 


ZNF322A 


TCERG1 


TEGT 


LDLR 


MDN1 


CBX5 


ERCC1 


PTP4A2 


MTHFSD 


SF3B5 


ZNF35 


Cllorfl 


GPATC8 


CAPG 


TUT1 


KIAA0495 


LOC441426 


PCDHGB3 


C19orf53 


PFDN5 


LOCI  62993 


MIDI IP1 


NA 


PRCD 


BARX1 


POLR1B 


WDR45L 


ALDH7A1 


UCK2 


NA 


UQCRFS1 


RAX 


COPS4 


PLA2G1B 


7562  37592 


23332 


5272 


9631 


23239 


1663 


9064 


25819 


84196 


23195 


51065 


441242 


151246 


576 


6169 


345778 


7673 


1997 


55904 


60675 


441426 


79692 


10915 


7009 


3949 


23195 


23468 


2067 


8073 


64779 


83443 


7584 


64776 


23131 


822 


64852 


57212 


441426 


56102 


28974 


5204 


1 62993 


58526 


768206 


56033 


84172 


56270 


501 


7371 


206290 


62155 


0 


1 75272 


251901 


8928 


68863 


385 


9913 


31786 


102113 


16122 


55019 


9029 


43790 


22006 


131747 


255667 


345495 


1218276 


148399 


68570 


34650 


66029 


9909 


413 


1728 


83836 


12212 


54643 


23472 


7216 


53781 


13381 


10076 


4745 


1218276 


0 


9834 


3514 


49985 


239743 


19717 


84587 


42169 


84226 


5438 


59352 


Downstream 


NA 


Upstream 


Overlapping 


Upstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Upstream 


Downstream 


NA 


Downstream 


Upstream 


Upstream 


Downstream 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Overlapping 


Downstream 


Upstream 


NA 


Downstream 


Downstream 


Downstream 


7386  1812097 


30062  45823 


51138  24019 


5319  14847 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


ZNF493  284443 


NA 


MKI67IP  84365 


SERPINB6  5269 


NUP155  9631 


ZCCHC2  54877 


FAM60A  58516 


FCN3  8547 


ELF2  1998 


USP48  84196 


CASP8AP2  9994 


RAB8B  51762 


VKORC1L1  154807 


KCTD18  130535 


SPOCD1  90853 


TTYH2  94015 


NA 


THBS4  7060 


ZNF230  7773 


SLC25A15  10166 


SRPK2  6733 


RYBP  23429 


ANKRD20A3  441425 


ABT1  29777 


GPR151  134391 


FMNL3  91010 


SPBC24  147841 


CASP8AP2  9994 


HNRPA1  3178 


ERCC1  2067 


KHDRBS1  10657 


FOXC2  2303 


STX1 1  8676 


ZNF197  10168 


ALG9  79796 


FZD2  2535 


SH2D6  284948 


MTA2  9219 


CCDC27  148870 


ANKRD20A3  441425 


PCDHGB3  56102 


MGC3207  84245 


C12orf10  60314 


FBXL12  54850 


TSPAN7  7102 


NA 


CYGB  114757 


PTPDC1  138639 


CHCHD5  84269 


FOXK2  3607 


RNUXA  51808 


TMCOI  54499 


NA 


POP4  10775 


CPLX4  339302 


SIRT4 


23409 


Best  E- 
Box 


NA 


NA 


0.001 


NA 


NA 


0 


0 


NA 


0.001 


NA 


0.581 


NA 


0 


NA 


NA 


NA 


NA 


0.946 


NA 


NA 


0.008 


NA 


NA 


NA 


0.999 


NA 


NA 


0.581 


0.973 


0.003 


NA 


0 


0 


NA 


0.001 


NA 


NA 


NA 


0 


NA 


0 


0 


0.001 


NA 


NA 


NA 


NA 


NA 


0.998 


0.001 


0 


NA 


NA 


NA 


0.61 


0 


0.86 


Best  Hit 


Genel  Symbol  Genel  ID 


KIAA1539 


STAB2 


NA 


NA 


HIST2H4A 


TNP02 


C6orf62 


HSBP1 


TBRG4 


MDH1B 


XRCC6 


NA 


FLJ40722 


PKIB 


SCFD2 


R0B03 


ESD 


SLC5A7 


IWS1 


SPG20 


POP5 


RWDD1 


KLHL18 


TPRKB 


MEA1 


ZNF552 


IARS 


GCAT 


RPS27L 


ZNF616 


C8orf37 


CDC7 


B3GALT2 


IQGAP3 


NA 


RPS5 


FBX05 


NA 


CCDC38 


ARHGAP12 


DDX1 


DIRAS3 


IQGAP3 


SIX3 


CD3EAP 


HSBP1 


EED 


ARL4D 


NA 


RPLP1 


MTPN 


HD 


CDC14A 


ERCC1 


ABCF1 


NA 


TBC1D22A 


80256 


55576 


8370 


30000 


81688 


3281 


9238 


130752 


2547 


285966 


5570 


1 52579 


64221 


2098 


60482 


55677 


23111 


51367 


51389 


23276 


51002 


4201 


79818 


3376 


23464 


51065 


90317 


157657 


8317 


8707 


128239 


6193 


26271 


120935 


94134 


1653 


9077 


128239 


6496 


10849 


3281 


8726 


379 


6176 


136319 


3064 


8556 


2067 


23 


25771 


Gene2 

Distance 


13107 


75173 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


0 


10443 


53368 


610006 


45685 


887 


550 


50346 


138060 


11601 


30941 


99301 


648289 


107481 


83241 


0 


41862 


11 


35456 


0 


29815 


0 


2304 


31786 


49924 


135234 


99585 


132329 


19004 


21454 


18823 


677 


128113 


30109 


181716 


19004 


81041 


1354 


610006 


56857 


73930 


1 79758 


228112 


111847 


919 


626 


13920 


23845 


Downstream 


Downstream 


NA 


NA 


Overlapping 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


Upstream 


Downstream 


Upstream 


Downstream 


Upstream 


Downstream 


Overlapping 


Downstream 


Downstream 


Overlapping 


Downstream 


Overlapping 


Downstream 


Downstream 


Upstream 


Downstream 


Upstream 


Downstream 


NA 


Downstream 


Upstream 


NA 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


Downstream 


STOML2 


C12orf42 


NA 


NA 


H4/o 


C19orf43 


GMNN 


CDH13 


RAMP3 


FASTKD2 


D15Wsu75e 


NA 


LOC441294 


PKIB 


FIP1L1 


ROB04 


HTR2A 


SULT1C3 


PROC 


SOHLH2 


POP5 


C6orf78 


KIF9 


NAT8B 


KLHDC3 


ZNF587 


IARS 


HI  F0 


RAB8B 


PPP2R1A 


PLEKHF2 


HFM1 


CDC73 


APOA1BP 


NA 


ZNF584 


MTRF1L 


NA 


AMDHD1 


KIF5B 


NAG 


GPR177 


APOA1BP 


SIX2 


PPP1R13L 


CDH13 


C11orf73 


TMEM106A 


NA 


KIF23 


UNQ1940 


GRK4 


CDC14A 


ERCC1 


GNL1 


NA 


CERK 


30968 


374470 


554313 


79002 


51053 


1012 


10268 


22868 


27351 


441294 


5570 


81608 


54538 


3356 


442038 


5624 


54937 


51367 


221301 


64147 


51471 


116138 


84914 


3376 


3005 


51762 


5518 


79666 


1 64045 


79577 


128240 


201514 


54516 


144193 


3799 


51594 


79971 


128240 


10736 


10848 


1012 


51501 


113277 


9493 


389558 


2868 


8556 


2067 


2794 


64781 


Best  E- 
Box 


1 


NA 


NA 


NA 


NA 


0.03 


0 


0 


0 


0 


NA 


NA 


0.001 


0 


0.976 


0 


0 


NA 


NA 


0.004 


0.999 


NA 


1 


NA 


0.558 


0.001 


0 


NA 


NA 


0 


NA 


NA 


NA 


NA 


NA 


NA 


0.022 


NA 


0 


NA 


0 


0.997 


NA 


NA 


0.024 


0 


NA 


NA 


NA 


0.001 


NA 


NA 


NA 


NA 


0 


NA 


0.004 


Best  Hit 


26083 


4968 


3659 


54331 


22903 


10780 


1 53364 


55672 


1131 


2115 


140739 


57496 


79694 


7572 


63894 


83861 


Genel  Symbol  Genel  ID 


NA 


HIST1H2BL 


NA 


DKFZP434O047 


OGGI 


IRF1 


GNG2 


BTBD3 


NA 


ZNF234 


LOCI  53364 


NBPF1 


NA 


CHRM3 


ETV1 


UBE2F 


MKL2 


MANE  A 


NA 


INTS7 


C9orf123 


PRDM1 


ZNF24 


C14orf133 


RSHL2 


NA 


RAB37 


NA 


PYDC1 


NA 


SLC26A6 


RPS27L 


NA 


OR4D9 


ESD 


DHX16 


FLJ90650 


ZNF559 


ESD 


SP8 


PLCB4 


ZIC2 


DKFZp666G057 


ZNF17 


NA 


MGC5081 1 


SMN2 


VDP 


ATAD1 


FLCN 


TBC1D22A 


TNFAIP3 


GALR1 


RPL21 


NA 


ZMYM1 


POLR3A 


Gene2 

Distance 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


NA 


Downstream 


NA 


123016 


183 


53175 


128791 


588787 


23147 


270 


74089 


704733 


851654 


94093 


151537 


438545 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


Downstream 


NA 


Downstream 


Upstream 


Downstream 


Upstream 


Downstream 


NA 


90871  1041642 


639  1 04442 


32816 


23 


44740 


65010 


51065 


390199 


2098 


8449 


206338 


84527 


2098 


221833 


5332 


7546 


283726 


7565 


375307 


6607 


8615 


84896 


201163 


25771 


7128 


2587 


6144 


79830 


11128 


1030 


31786 


47485 


99301 


17813 


111544 


18892 


99301 


130145 


446539 


9502 


65 


21177 


13847 


24377 


50950 


45446 


426 


23845 


239212 


102127 


18795 


47212 


4203 


Upstream 


Downstream 


Upstream 


Downstream 


Upstream 


NA 


Downstream 


NA 


Upstream 


NA 


Downstream 


Downstream 


NA 


Upstream 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


NA 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


NA 


HIST1H2AI 


NA 


GOSR1 


OGGI 


IL5 


C14orf166 


BTBD3 


NA 


ZNF226 


POLR3G 


NBPF10 


NA 


FMN2 


DGKB 


SCLY 


ERCC4 


FUT9 


NA 


DTL 


JMJD2C 


PRDM1 


ZNF396 


AHSA1 


TAGAP 


NA 


CD300E 


NA 


TRIM72 


NA 


SLC26A6 


RAB8B 


NA 


OR4D11 


HTR2A 


NRM 


AP3S1 


ZNF699 


HTR2A 


ABCB5 


C20orf103 


ZIC5 


EFTUD1 


ZNF548 


NA 


SLC11A1 


SERF1A 


G3BP2 


PTEN 


FLCN 


CERK 


PERP 


MBP 


RASL11A 


NA 


ZMYM6 


RPS24 


9527 


4968 


3567 


51637 


22903 


7769 


10622 


440673 


56776 


1607 


51540 


2072 


10690 


51514 


23081 


639 


252884 


10598 


1 1 7289 


65010 


51762 


219986 


3356 


11270 


1176 


374879 


3356 


340273 


24141 


85416 


79631 


147694 


6556 


8293 


9908 


5728 


201163 


64781 


64065 


4155 


387496 


Best  E- 
Box 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


0 


1 


NA 


NA 


NA 


NA 


0 


NA 


NA 


NA 


NA 


0 


NA 


0.002 


NA 


0.003 


NA 


NA 


NA 


0 


NA 


0.001 


NA 


NA 


0 


0 


NA 


NA 


0 


0 


NA 


NA 


NA 


0 


0 


NA 


NA 


NA 


0.539 


NA 


NA 


0.004 


NA 


0.001 


NA 


NA 


NA 


0.266 


Best  Hit 


Genel  Symbol 


FLJ2151 1 


SERBP1 


BRWD2 


NEUROD2 


BNC1 


SMN2 


TUT1 


AP2A2 


NA 


IQGAP3 


TRAF7 


PGD 


COL9A1 


KBTBD6 


KIAA0143 


UHMK1 


ATP6V1G1 


ZNF184 


KIAA0182 


ASPHD2 


CDS1 


CLNS1A 


PET112L 


FXR1 


DDX23 


NBPF1 


GUSBL2 


NBPF1 


GAP43 


RG9MTD2 


SLC1A5 


TUT1 


CD44 


MPHOSPH1 


SLC5A7 


ZNF430 


MCEE 


C5orf35 


RPL21 


MYC 


ARL6IP1 


NA 


ASPHD2 


NA 


SPRYD4 


FKSG30 


LPHN2 


C8orf37 


OXR1 


BCAT2 


MYC 


RPL28 


CNN1 


NGLY1 


LRRC33 


NBPF1 


INTS7 


Gene2 

Genel  ID  .  Gene2  Direction  Gene2  Symbol  Gene2  ID 
Distance  1 


80157 


26135 


55717 


4761 


646 


6607 


64852 


161 


128239 


84231 


5226 


1297 


89890 


23167 


127933 


9550 


7738 


23199 


57168 


1040 


1207 


5188 


8087 


9416 


55672 


375513 


55672 


2596 


93587 


6510 


64852 


960 


9585 


60482 


80264 


84693 


133383 


6144 


4609 


23204 


605777 


122636 


381578 


52623 


81830 


24377 


10080 


59315 


19004 


6304 


30436 


19775 


61491 


576258 


63544 


23138 


130338 


134821 


1 1 8923 


84002 


47722 


306816 


76325 


13702 


131670 


145821 


74089 


510980 


10627 


41698 


10080 


222428 


564951 


648288 


61247 


52 


41391 


18795 


214454 


10734 


Downstream 


283377 


440915 


23266 


157657 


55074 


587 


4609 


6158 


1264 


55768 


375387 


55672 


25896 


13831 


48243 


2525506 


135222 


112681 


38940 


214457 


9071 


9362 


7161 


70593 


74089 


0 


Upstream 


Downstream 


Downstream 


Upstream 


Downstream 


NA 


Downstream 


Upstream 


Downstream 


Upstream 


Upstream 


Downstream 


Downstream 


Upstream 


Downstream 


Upstream 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


Upstream 


NA 


Downstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


OCIAD2 


IL12RB2 


C10orf85 


PPP1R1B 


SH3GL3 


SERF1A 


MTA2 


CHID1 


NA 


APOA1BP 


RAB26 


APITD1 


COL9A1 


KBTBD7 


KCNQ3 


UAP1 


C9orf91 


LOC346157 


GINS2 


HPS4 


NKX6-1 


AQP11 


FBXW7 


DNAJC19 


RND1 


LOC388692 


GUSBL2 


NBPF10 


ZBTB20 


MTTP 


STRN4 


MTA2 


PDHX 


PANK1 


SULT1C3 


ZNF714 


MPHOSPH10 


MIER3 


RASL11A 


TMEM75 


RPS15A 


NA 


HPS4 


NA 


MIP 


C2orf27 


TTLL7 


PLEKHF2 


ABRA 


FGF21 


TMEM75 


MDAC1 


ECSIT 


OXSM 


WDR53 


NBPF10 


DTL 


132299 


3595 


404216 


84152 


6457 


8293 


9219 


66005 


128240 


25837 


378708 


1297 


84078 


3786 


6675 


203197 


346157 


51659 


89781 


4825 


282679 


55294 


131118 


27289 


388692 


375513 


440673 


26137 


4547 


29888 


9219 


8050 


53354 


442038 


148206 


10199 


166968 


387496 


641384 


6210 


4284 


29798 


79739 


79666 


137735 


26291 


641384 


147744 


51295 


54995 


348793 


440673 


51514 


Best  E- 
Box 


0 


NA 


0 


0.001 


NA 


NA 


NA 


0 


NA 


NA 


0.998 


0.001 


NA 


NA 


0.006 


0 


0.179 


NA 


NA 


0.001 


NA 


0 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


0.008 


0 


NA 


0 


NA 


NA 


0.001 


NA 


0.034 


0.005 


NA 


NA 


NA 


NA 


0 


NA 


NA 


0 


NA 


NA 


NA 


Best  Hit 


Genel  Symbol  Genel  ID 


DDX50 


LOCI  341 45 


JAK2 


ZC3H15 


NA 


BCL7C 


NA 


MIDI IP1 


NPM3 


HELT 


NA 


NBPF8 


ATXN2L 


NA 


GULP1 


GRAP 


RNASEN 


JAG2 


ATP1A1 


Cllorfl 


ZNF790 


CXCR7 


FOXD2 


C6orf170 


KIAA1008 


FLJ38723 


NA 


CLDN23 


C9orf39 


NA 


ZNF675 


FLJ35258 


CDCA2 


MTDH 


CALM1 


DCTD 


LOC641522 


FIBP 


OAS3 


SOX14 


CXCR4 


NA 


TRIM15 


FBX033 


C7orf28B 


HFE2 


SORBS3 


LOC388692 


NA 


PRAMEF16 


DEFB131 


ZNF418 


KIAA1539 


NA 


ZNF786 


NA 


ST8SIA3 


79009 


134145 


3717 


55854 


58526 


10360 


391723 


641559 


11273 


51454 


10750 


29102 


3714 


476 


64776 


388536 


57007 


2306 


221322 


22894 


255180 


137075 


54875 


171392 


284297 


157313 


92140 


801 


1635 


641522 


9158 


4940 


8403 


7852 


89870 


254170 


221960 


148738 


10174 


388692 


654348 


644414 


147686 


80256 


Gene2 

Distance 


54306 


274 


191410 


103109 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


239739 


6818 


128244 


91643 


1222 


497357 


59030 


354 


11541 


152117 


7237 


11309 


246110 


29172 


100517 


0 


79242 


1 60249 


171875 


71565 


31740 


549 


131707 


51034 


256166 


65998 


1840 


31185 


230776 


648186 


8456 


165711 


176572 


113567 


9995 


85603 


38855 


21446 


12348 


13107 


Downstream 


Downstream 


Upstream 


Downstream 


NA 


Downstream 


NA 


Downstream 


Downstream 


NA 


Upstream 


NA 


Upstream 


Downstream 


Downstream 


Upstream 


Downstream 


Downstream 


Upstream 


Downstream 


Overlapping 


Downstream 


NA 


Downstream 


NA 


Upstream 


Downstream 


Downstream 


Downstream 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


NA 


Upstream 


Upstream 


Upstream 


Downstream 


NA 


Downstream 


Downstream 


NA 


51046  225337 


NA 


Downstream 


DDX21 


CCT5 


RCL1 


ITGAV 


NA 


CTF1 


NA 


TSPAN7 


FGF8 


SLC25A4 


NA 


NBPF11 


ATXN2L 


NA 


DIRC1 


FAM83G 


C5orf22 


NUDT14 


CD58 


ALG9 


ZNF345 


CXCR7 


FOXE3 


GJA1 


C13orf24 


NLF2 


NA 


MFHAS1 


BNC2 


NA 


ZNF681 


NAT  14 


KCTD9 


LAPTM4B 


C14orf102 


FLJ30277 


LRRC37A 


CCDC85B 


OAS1 


CLDN18 


CXCR4 


NA 


TRIM15 


CTAGE5 


LOC222967 


NOTCH2NL 


SORBS3 


NBPF1 


NA 


PRAMEF9 


LOC650293 


ZNF256 


STOML2 


NA 


ZNF425 


NA 


ONECUT2 


9188 


22948 


10171 


3685 


200030 


11273 


1 1 6093 


644815 


55322 


256281 


965 


79796 


25850 


57007 


2301 


2697 


10464 


388125 


9258 


54796 


148213 


57106 


54793 


55353 


55051 


152641 


9884 


11007 


4938 


51208 


7852 


89870 


4253 


222967 


388677 


10174 


55672 


343070 


650293 


10172 


30968 


Best  E- 
Box 


NA 


NA 


NA 


0.446 


NA 


0.266 


NA 


NA 


0.998 


NA 


NA 


0 


NA 


NA 


0 


NA 


NA 


NA 


NA 


0.001 


0.003 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


0 


NA 


NA 


NA 


NA 


0.077 


NA 


0 


NA 


NA 


0 


0.003 


0 


NA 


0 


0.728 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


1 


NA 


0 


NA 


NA 


Best  Hit 


Genel  Symbol  Genel  ID 


NA 


GPSN2 


PRDM14 


PRKAG2 


TBRG4 


ZNF429 


NA 


NA 


OXCT2 


NOL6 


CSTF3 


NA 


SOX9 


NA 


CHRNA7 


PCGF2 


RPS27 


IGSF22 


NSUN5B 


NA 


CD5 


WT1 


AMZ2 


FHOD3 


CCDC99 


POLR2D 


CHMP6 


NA 


C13orf7 


ASTN2 


NKX2-5 


FLJ20366 


IP08 


EHMT2 


NA 


PVRL2 


NA 


RAM 


TBX2 


ERN1 


NA 


NA 


RARB 


POLI 


MTX2 


LHX8 


ZNF498 


NA 


EIF3S5 


YWHAZ 


FLJ44881 


RPL15 


WHSC2 


LOC649159 


COL5A3 


VASP 


HOXA13 


9524 


63978 


51422 


9238 


353088 


64064 


65083 


1479 


1139 


7703 


6232 


283284 


155400 


921 


7490 


51321 


80206 


54908 


5433 


79643 


79596 


23245 


1482 


55638 


10526 


10919 


10743 


6909 


2081 


5915 


11201 


10651 


431707 


221785 


8665 


7534 


400661 


6138 


7469 


649159 


50509 


7408 


3209 


Gene2 

Distance 


10598 


199618 


60214 


45685 


1 08447 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


17245 


25637 


50 


375156 


5191 


4297 


22767 


46645 


70594 


9786 


90590 


93544 


53110 


26794 


142655 


55311 


43418 


84784 


1 03803 


59283 


3604 


71777 


14963 


15986 


427207 


44529 


80969 


403307 


57732 


23515 


1 05345 


329506 


567 


49798 


694867 


3765 


11395 


21800 


NA 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


NA 


Upstream 


Downstream 


Downstream 


NA 


Upstream 


NA 


Downstream 


Downstream 


Downstream 


Upstream 


NA 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


Upstream 


Upstream 


NA 


Downstream 


Downstream 


Downstream 


NA 


NA 


Upstream 


Downstream 


Upstream 


Upstream 


NA 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


NA 


DNAJB1 


SLC05A1 


PRKAG2 


RAMP3 


ZNF493 


NA 


NA 


BMP8B 


AQP3 


CSTF3 


NA 


SLC39A1 1 


NA 


OTUD7A 


PSMB3 


RAB13 


TMEM86A 


TRIM73 


NA 


CD6 


WIT  1 


SLC16A6 


MOCOS 


DOCK2 


MGC4268 


BAIAP2 


NA 


POU4F1 


TRIM32 


BNIP1 


EBAG9 


C1QDC1 


ZBTB12 


NA 


PVRL2 


NA 


PEMT 


Cl  7orf82 


ERN1 


NA 


NA 


TOP2B 


MBD2 


HOXD1 


TYW3 


ZNF655 


NA 


NALP10 


YWHAZ 


ZNF236 


NKIRAS1 


NAT8L 


TP53TG3 


RDH8 


RTN2 


EVX1 


3337 


81796 


51422 


10268 


284443 


161725 


5691 


5872 


144110 


375593 


923 


51352 


9120 


55034 


1794 


83607 


10458 


5457 


22954 


662 


9166 


65981 


221527 


10400 


388407 


2081 


7155 


8932 


3231 


127253 


79027 


338322 


7534 


7776 


28512 


339983 


24150 


50700 


6253 


2128 


Best  E- 
Box 


NA 


NA 


NA 


0 


0 


NA 


NA 


NA 


NA 


0.844 


NA 


NA 


NA 


NA 


0.041 


NA 


NA 


0.123 


1 


NA 


0 


0 


NA 


0.001 


NA 


NA 


0.986 


NA 


1 


NA 


NA 


1 


NA 


1 


NA 


NA 


NA 


0.011 


0.007 


0.999 


NA 


NA 


0.012 


0 


NA 


NA 


NA 


NA 


0.063 


0.001 


NA 


NA 


NA 


NA 


0 


NA 


NA 


Best  Hit 


Genel  Symbol  Genel  ID 


R0B03 


NA 


KIAA0494 


LCORL 


DHRS9 


NA 


NA 


NA 


KLRB1 


CBX8 


NKX2-8 


MEIS1 


PLTP 


NA 


KCMF1 


PDZK1 


TMEM50A 


PLAG1 


STC2 


HS3ST3B1 


HFE2 


TSEN2 


MRPL48 


EOMES 


ZNF680 


BYSL 


LOC401 152 


NA 


SLC5A1 


SLC25A32 


FOXF2 


SF3B5 


NA 


CCDC8 


MAP2K2 


AP4M1 


AGTRAP 


DDX1 


LOC388692 


SH2D6 


COL9A1 


NR2F2 


GRPEL2 


NA 


ZNF665 


INTU 


ALX4 


CHST9 


IMPAD1 


NA 


SHOX2 


TMEM33 


RTN4IP1 


UQCRQ 


TMEM33 


CPVL 


ZNF225 


64221 


Gene2 

Distance 


30941 


Gene2  Direction  Gene2  Symbol  Gene2  ID 


9813  50070 


254251  390981 


10170  113484 


3820 


57332 


26257 


4211 


5360 


56888 


5174 


23585 


5324 


8614 


9953 


148738 


80746 


51642 


8320 


340252 


705 


401152 


6523 


81034 


2295 


83443 


83987 


5605 


9179 


57085 


1653 


388692 


284948 


1297 


7026 


134266 


79788 


27152 


60529 


83539 


54928 


6474 


55161 


84816 


27089 


55161 


54504 


7768 


98729 


0 


84489 


54643 


Upstream 


NA 


Downstream 


221892 


25678 


64666 


951513 


19580 


64759 


110565 


64558 


700 


51909 


58212 


42628 


72146 


18019 


257932 


2346995 


212 


21584 


Upstream 


NA 


NA 


NA 


Downstream 


Downstream 


Downstream 


Upstream 


NA 


Upstream 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


Upstream 


NA 


Upstream 


Upstream 


Downstream 


NA 


55007 


42041 


191 


44636 


30152 


85603 


26751 


19775 


1232752 


12572 


19516 


97214 


215426 


648269 


697365 


2491 


46119 


0 


1780 


46119 


47945 


18594 


Downstream 


Upstream 


Downstream 


Downstream 


Downstream 


Upstream 


Downstream 


Downstream 


NA 


Downstream 


Downstream 


Upstream 


Downstream 


NA 


Downstream 


Downstream 


Downstream 


Downstream 


Downstream 


ROB04 


NA 


ATPAF1 


NCAPG 


DHRS9 


NA 


NA 


NA 


CLEC2D 


CBX2 


TITF1 


ETAA1 


PPGB 


NA 


TMSB10 


CD160 


RFID 


CHCHD7 


NKX2-5 


CDRT15 


TXNIP 


MKRN2 


RAB6A 


SLC4A7 


ZNF588 


TRFP 


FABP2 


NA 


YWHAH 


WDSOF1 


FOXQ1 


STX11 


NA 


FLJ 10781 


ZBTB7A 


MCM7 


C1orf187 


NAG 


NBPF1 


CAPG 


COL9A1 


SPATA8 


PCYOX1 L 


NA 


FLJ46385 


SLC25A31 


EXT2 


CDH2 


PENK 


NA 


SHOX2 


WDR21B 


QRSL1 


GDF9 


WDR21B 


CHN2 


ZNF224 


54538 


64756 


64151 


10170 


29121 


84733 


7080 


54465 


5476 


9168 


11126 


6007 


79145 


1482 


146822 


10628 


23609 


5870 


9497 


51427 


9477 


2169 


7533 


25879 


94234 


8676 


55228 


51341 


4176 


374946 


51594 


55672 


822 


1297 


145946 


78991 


390963 


83447 


2132 


1000 


5179 


6474 


285429 


55278 


2661 


285429 


1124 


7767 


Best  E- 
Box 


0 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


0 


NA 


NA 


NA 


NA 


0.052 


NA 


0.002 


NA 


0 


NA 


NA 


0.026 


0 


0 


NA 


0.652 


NA 


NA 


NA 


0.831 


NA 


0 


NA 


NA 


0 


0.987 


0.001 


0 


NA 


NA 


NA 


NA 


0 


NA 


NA 


0.001 


NA 


NA 


NA 


NA 


NA 


0.001 


NA 


NA 


0.001 


NA 


NA 


Genel  Symbol  Genel  ID 


PCDH10 


PNPLA7 


REX04 


PROP1 


ELOVL5 


MAG  13 


KIAA1539 


ZNF471 


TJP2 


LYN 


FLJ44881 


LGI2 


SGOL1 


C19orf53 


MYC 


SMAD7 


MGC5081 1 


NA 


TRH 


TLE1 


AFF3 


NA 


ACPI 


TBC1D2B 


PRMT2 


LMNB1 


NA 


HLCS 


MEIS1 


HIST1H2BC 


FLJ44881 


FHOD3 


NA 


KREMEN2 


C17orf41 


ZNF322A 


ZNF222 


BANK1 


RNF19 


3-Mar 


PCDHGB1 


MTERF 


NASP 


KLC4 


NA 


ZNF35 


ZNF322A 


CDKN2C 


ENDOGL1 


INSC 


NIPA1 


DNTTIP2 


HIST2H3C 


URG4 


LANCL1 


NA 


AKAP13 


57575 


375775 


57109 


5626 


60481 


260425 


80256 


57573 


9414 


4067 


400661 


55203 


151648 


28974 


4609 


4092 


375307 


52 


23102 


3275 


4001 


3141 


4211 


8347 


400661 


80206 


79412 


79915 


79692 


7673 


55024 


25897 


115123 


56104 


7978 


4678 


89953 


7584 


79692 


1031 


9941 


387755 


123606 


30836 


126961 


55665 


10314 


Gene2 

Distance 


1095147 


14724 


3536 


109201 


185710 


125517 


13107 


30060 


85245 


71206 


564819 


34896 


65374 


9834 


209631 


295628 


13847 


217258 


186773 


56186 


1 75020 


25756 


821979 


442 


564817 


93544 


4900 


7080 


124371 


22006 


443238 


6933 


1 64352 


0 


59703 


33137 


93 


23474 


124364 


1418 


41822 


538260 


52279 


61930 


535 


311 


103686 


11214  245305 


Gene2  Direction 

Gene2  Symbol 

Downstream 

PCDH10 

Downstream 

MRPL41 

Downstream 

ADAMTS13 

MSsSRjHI 

THOC3 

Upstream 

GCLC 

Downstream 

MAGI3 

Downstream 

STOML2 

Downstream 

ZNF667 

Upstream 

FXN 

Downstream 

TMEM68 

Downstream 

ZNF236 

Downstream 

Upstream 

SGOL1 

■sssbm 

MGC3207 

Upstream 

TMEM75 

IHSsSBH 

KIAA0427 

Downstream 

SLC11A1 

NA 

NA 

Downstream 

ARVP6125 

Downstream 

FLJ46321 

Upstream 

AFF3 

NA 

NA 

Downstream 

ACPI 

KIsSBm 

CIB2 

Downstream 

S100B 

■SSSBjH 

FLJ27505 

NA 

NA 

Downstream 

DSCR6 

Downstream 

ETAA1 

Downstream 

HIST1H2AC 

Downstream 

ZNF236 

MSsSBSi 

MOCOS 

NA 

NA 

Downstream 

PAQR4 

Downstream 

CRLF3 

MSsSRjBI 

ABT1 

Upstream 

ZNF230 

Downstream 

PPP3CA 

Downstream 

RNF19 

■SsSBm 

LMNB1 

Overlapping 

PCDHGB1 

Downstream 

AKAP9 

Upstream 

AKR1A1 

■mm 

KLC4 

NA 

NA 

HSSSBjH 

ZNF197 

Upstream 

ABT1 

Downstream 

CDKN2C 

ACVR2B 

■m 

INSC 

!  Downstream 

NIPA2 

■SUB 

GCLM 

Downstream 

HIST2H3A 

Downstream 

UBE2D4 

Downstream 

CPS1 

NA 

NA 

Downstream 

AKAP13 

57575 


64975 


11093 


84321 


2729 


260425 


30968 


63934 


2395 


137695 


7776 


91050 


151648 


84245 


641384 


9811 


6556 


442092 


389763 


3899 


52 


10518 


6285 


389320 


53820 


54465 


8334 


7776 


55034 


124222 


51379 


29777 


7773 


5530 


25897 


4001 


56104 


10142 


10327 


89953 


10168 


29777 


1031 


93 


387755 


81614 


2730 


333932 


51619 


1373 


11214 


Best  E- 
Box 


NA 


NA 


NA 


NA 


NA 


NA 


1 


NA 


NA 


0.043 


NA 


NA 


NA 


0 


NA 


NA 


NA 


NA 


NA 


1 


NA 


NA 


NA 


NA 


0.597 


NA 


NA 


NA 


NA 


NA 


NA 


0.001 


NA 


0 


NA 


NA 


NA 


NA 


0.001 


NA 


0.135 


0 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


NA 


0.001 


NA 


NA 


NA 


Best  Hit 


Best  Location 


Location 


Length  x  Genel  Direction  Genel  Symbol 
3  Distance  1 


NA 

chr7:62451 81 3-624521 58 

136 

1208761 

Upstream 

ZNF680 

chr7:27421 39-27421 45 

chr7:2740920-2741293 

355 

55232 

AMZ1 

chr9:83397844-83397850 

chr9:81 523458-81 52371 7 

212 

146761 

Upstream 

TLE4 

chr1:1 9450900-1 9450906 

NA 

NA 

NA 

NA 

NA 

chrl  9:50373795-50373801 

NA 

NA 

NA 

NA 

NA 

chrl  2:564521 64-564521 70 

chrX:251 231 36-251 23270 

76 

179361 

Downstream 

ARX 

NA 

chr7:62451 81 3-624521 58 

136 

1208761 

Upstream 

ZNF680 

chrl  2:8271 408-8271 41 4 

chr4:3990600-3991 350 

604 

252507 

BwMiranf 

ADRA2C 

chrl  9:5741 041 -5741 047 

NA 

NA 

NA 

NA 

NA 

chrl  2:1 076631 0-1 076631 6 

NA 

NA 

NA 

NA 

NA 

chr9:21 47925-21 47931 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:49290248-49290336 

53 

0 

ZNF224 

chr5:1 1022370-1 1022376 

NA 

NA 

NA 

NA 

NA 

chrl  9:5741 843-5741 849 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:7050625-7050788 

156 

30155 

Upstream 

ZNF557 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:21 11 6591 -21 11 6939 

201 

0 

Overlapping 

ZNF431 

NA 

chrl  7:56501 999-56502038 

36 

330000 

Downstream 

TBX2 

NA 

chrl  3:40604532-40605075 

282 

0 

Overlapping 

KBTBD6 

NA 

chrl  :1 1 8273592-1 1 8274037 

446 

0 

GDAP2 

NA 

chr20:2461 2659-2461 2734 

76 

214825 

Upstream 

C20orf39 

NA 

chrl  77209567-77209688 

82 

103794 

■91BH 

ST6GALNAC5 

NA 

chr9:5328975-5329649 

404 

224 

Upstream 

RLN1 

NA 

chr9:420091 43-4200951 5 

345 

64112 

Downstream 

MGC21881 

chr4:1 09760420-1 09760426 

chr6:43403741  -43403841 

50 

19233 

Downstream 

CRIP3 

NA 

chrl  6:232261 1-2322698 

52 

8050 

ism 

ABCA3 

chr2:1 5649792-1 5649798 

NA 

NA 

NA 

NA 

NA 

chr5:1 1022370-1 1022376 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:2004805-2004876 

68 

465 

Downstream 

SMARCA2 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:221 72072-221 72078 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 34286935-1 34287591 

462 

18227 

fl.MICEHlIl 

PCBD2 

chrl  9:63023020-63023026 

chrl  9:631 19403-631 19709 

196 

81 

Upstream 

ZNF417 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:961 32093-961 321 85 

85 

0 

MANEA 

chr9:39807753-39807759 

chr9:39277958-39278790 

691 

0 

Overlapping 

CNTNAP3 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:749887-749946 

60 

1477 

Upstream 

PTBP1 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:302901 19-30290125 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  6:20725792-20726021 

222 

459 

Upstream 

LOC81691 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  4:1 8862857-1 8862929 

45 

108076 

WHS* 

LOC440157 

NA 

chr5:1 081 47697-1 081 47802 

51 

36276 

FER 

NA 

chr7:1 6651 637-1 6651 766 

126 

157 

1 1  in  i  nw 

ANKMY2 

NA 

chrl  77209567-77209688 

82 

103794 

■m 

ST6GALNAC5 

chrl  7:75426285-75426291 

chrl  7:75425774-75425980 

207 

1828 

■SSSBH 

CBX4 

chr5:1 1022370-1 1022376 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:1 1 3834303-1 1 3834378 

40 

56243 

iMBSSBl 

LARP7 

chrl  2:1 24039584-1 24039590 

NA 

NA 

NA 

NA 

NA 

chr6:1 08031 90-1 08031 96 

chr6:1 0803065-1 08031 22 

58 

51 

Downstream 

PAK1IP1 

Best  Location 


Location 


Length  x  Genel  Direction  Genel  Symbol 
3  Distance  J 


NA 

chrl  9:21 303463-21 304001 

302 

51 

Upstream 

ZNF708 

NA 

chrl  :556638-55771 8 

980 

54179 

■iHSPi 

OR4F3 

chr2:1 220051 39-1 220051 45 

chr2:1 22003874-1 220041 20 

240 

119402 

Upstream 

CLASP1 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:37407266-37407305 

40 

312 

Downstream 

NUP155 

chrl  8:58532681 -58532687 

NA 

NA 

NA 

NA 

NA 

chrl  2:31 11 7703-31 11 7709 

chrl  2:9491 697-9492229 

465 

159535 

Upstream 

KLRB1 

NA 

chrl  :27565075-27565220 

130 

704 

MAP3K6 

chr4:1 401 56057-1 401 56063 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:90586202-90586208 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  5:61 237072-61 2371 32 

61 

278 

Downstream 

RPS27L 

chr7:64872879-64872885 

chr7:641 78589-641 79449 

746 

89740 

Downstream 

H-plk 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :31 998486-31 998769 

284 

3466 

Upstream 

BAI2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 34289001  -1 34289707 

526 

20293 

Upstream 

PCBD2 

chr5:7932221 8-79322224 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:49222671  -49222844 

146 

1316 

Upstream 

ZNF222 

NA 

chr9:99040526-99040788 

165 

0 

KIAA1529 

chr7:1 04441 533-104441 539 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:420091 43-42009506 

340 

64112 

Downstream 

MGC21881 

NA 

chr6:58256529-58256963 

387 

138692 

GUSBL2 

chr5:1 45806777-1 45806783 

NA 

NA 

NA 

NA  1 

NA 

NA 

chrl  2:48421 533-48421 869 

337 

10642 

Downstream 

TEGT 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:90586202-90586208 

NA 

NA 

NA 

NA 

NA 

chrl  2:5296031 2-5296031 8 

NA 

NA 

NA 

NA 

NA 

chrl  9:50620785-50620791 

chrl  9:5061 9274-5061 931 4 

41 

257 

Downstream 

ERCC1 

NA 

chrl  :321 68543-321 68581 

35 

7994 

Upstream 

PTP4A2 

chrl  6:851 46022-851 46028 

chrl  6:851 46282-851 46478 

197 

0 

MTHFSD 

ch  r6 : 1 44458684- 1 44458690 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:4250431 1-42504406 

60 

44529 

■HI 

GHR 

chrl  1:1 11 254805-1 11 254811 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :3654470-3654476 

chrl  :3653774-3653883 

110 

724 

Downstream 

KIAA0495 

NA 

chr9:420091 43-4200951 5 

345 

64112 

Downstream 

MGC21881 

chr5:1 40729991  -1 40729997 

NA 

NA 

NA 

NA 

NA 

chrl  9:1 3746443-1 3746449 

chrl  9:1 3746645-1 3746990 

321 

389 

flwMirailll 

C19orf53 

chrl  2:51 97531 4-51 975320 

chrl  :220881 673-220881 762 

70 

51795 

Downstream 

TAFIA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 34289000-1 34289705 

513 

20292 

PCBD2 

NA 

chr9:1 05007050-1 050071 20 

59 

209567 

Upstream 

CYLC2 

NA 

chr9:957481 04-957481 75 

51 

9248 

■91BH 

BARX1 

chr2:1 1301 6666-1 1301 6672 

chr2:1 13015806-1 13015884 

79 

199 

Downstream 

POLR1B 

chrl  7:781 54963-781 54969 

chr7:1 1 3020280-1 1 3020364 

41 

325936 

PPP1R3A 

chr5:1 25958757-1 25958763 

chr5:76621 765-76621 958 

158 

79304 

MMII.LIi.-U..— 1 

PDE8B 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558006-558507 

435 

53390 

OR4F3 

NA 

chr7:62451 81 3-62452008 

66 

1208911 

ZNF680 

chrl  8:55090066-55090072 

chrl  9:3722535-3722674 

60 

545 

RAX  LI 

chr4:841 7491 3-841 7491 9 

chrX:41 891 087-41 891 182 

52 

223875 

Downstream 

CASK 

chrl  2:1 1 9240006-1 1 924001 2 

NA 

NA 

NA 

NA 

NA 

Best  Location 


Location 


Length  x  Genel  Direction  Genel  Symbol 
3  Distance  J 


chr9:351 06373-351 06379 

chr9:351 05993-351 06202 

198 

100 

Downstream 

KIAA1539 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:10486010-10486180 

119 

33170 

Upstream 

RNF141 

NA 

chrl  :558000-558507 

476 

53390 

HHSH 

OR4F3 

NA 

chrl  :1 480991 91  -148099464 

266 

0 

Overlapping 

HIST2H4A 

chrl  9:1 2695765-1 2695771 

NA 

NA 

NA 

NA 

NA 

chr6:24829836-24829842 

NA 

NA 

NA 

NA 

NA 

chrl  6:81 828081 -81 828087 

NA 

NA 

NA 

NA 

NA 

chr7:451 17713-451 17719 

NA 

NA 

NA 

NA 

NA 

chr2:207338840-207338846 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:79982889-799831 33 

217 

2916 

Downstream 

MSH3 

chr7:1 43029527-1 43029533 

chr7:1 431 63227-1431 63544 

94 

66561 

Upstream 

KIAA0738 

chr6:1 22973093-1 22973099 

NA 

NA 

NA 

NA 

NA 

chr4:53926970-53926976 

NA 

NA 

NA 

NA 

NA 

chrl  1 :1 24241 626-124241 632 

NA 

NA 

NA 

NA 

NA 

chrl  3:46268725-46268731 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr2:1 28000303-1 28000376 

74 

138 

Upstream 

IWS1 

chrl  3:35769606-3576961 2 

NA 

NA 

NA 

NA 

NA 

chrl  2:1 1 9503341  -1 1 9503347 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  2:1 09028554-1 09028643 

62 

17921 

Downstream 

IFT81 

chr3:47299496-47299502 

chr3:47298259-47298536 

278 

506 

Upstream 

KIF9 

NA 

chr2:7381 7097-7381 71 66 

70 

855 

■91BH 

TPRKB 

chr6:43089920-43089926 

NA 

NA 

NA 

NA 

NA 

chrl  9:63023020-63023026 

chrl  9:631 19403-631 19709 

196 

81 

ZNF417 

chr9:94095981 -94095987 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  5:61 237072-61 2371 32 

61 

278 

Downstream 

RPS27L 

chrl  9:57334896-57334902 

chrl  9:573351 99-5733531 7 

119 

196 

Downstream 

ZNF616 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  77209578-77209688 

79 

103805 

Upstream 

ST6GALNAC5 

NA 

NA 

NA 

NA 

NA 

NA 

ch  r6 : 1 53346 1 36- 1 53346 1 42 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  7:21 944883-21 945047 

97 

195387 

awwiraiiii 

FAM27L 

chrl  2:94860961  -94860967 

chrl  2:94860887-94860997 

107 

204 

Downstream 

AMDHD1 

NA 

chrl  0:32257290-32257349 

60 

427 

ARHGAP12 

chr2:1 5649792-1 5649798 

NA 

NA 

NA 

NA 

NA 

chrl  :682881 76-682881 82 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr2:450091 07-450091 45 

39 

13395 

Downstream 

SIX3 

chrl  9:50601 730-50601 736 

NA 

NA 

NA 

NA 

NA 

chrl  6:81 828081 -81 828087 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:8563361 1-85633697 

87 

149 

Upstream 

EED 

NA 

chrl  7:38795701-38795885 

149 

35993 

Downstream 

ARL4D 

NA 

chr5:1 34286935-1 34287591 

462 

18227 

BwMirailll 

PCBD2 

chrl  5:67673631  -67673637 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:30471 88-3047231 

44 

983 

ffWMirailllj 

HD 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:50620370-50620427 

42 

1353 

Downstream 

ERCC1 

chr6:306471 19-306471 25 

chr6:30646685-30646831 

123 

317 

Downstream 

ABCF1 

NA 

chrl  1:10486010-10486178 

117 

33172 

RNF141 

chr22:45537300-45537306 

NA 

NA 

NA 

NA 

NA 

Best  Location 


Location 


Length  x  Genel  Direction  Genel  Symbol 
3  Distance  J 


NA 

chrl  7:21 944883-21 945047 

97 

195387 

Upstream 

FAM27L 

NA 

chr6:2791 441 9-2791 4708 

222 

1 

■sn 

HIST1H2BN 

NA 

chr5:1 34290508-1 3429091 3 

342 

21800 

Upstream 

PCBD2 

NA 

chrl  7:25952007-25952238 

220 

41298 

■!HSH 

DKFZP434O047 

NA 

chr3:9766593-9766825 

233 

0 

Overlapping 

OGGI 

NA 

chr5:1 31 854039-131 854258 

220 

68 

IRF1 

NA 

chrl  4:51 3971 83-51 397290 

108 

16769 

Downstream 

GNG2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558000-558507 

468 

53390 

Upstream 

OR4F3 

chrl  9:49338358-49338364 

chrl  9:49290600-49290694 

64 

264 

WBmH 

ZNF224 

chr5:89805563-89805569 

chrl  8:44258665-44258883 

127 

60541 

Downstream 

KIAA0427 

NA 

chrl  :1 6939839-1 6940263 

377 

127270 

Downstream 

NBPF1 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :23761 61 96-23761 6311 

116 

242684 

Downstream 

CHRM3 

NA 

NA 

NA 

NA 

NA 

NA 

chr2:238540562-238540568 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  6:1 4073702-1 4073809 

96 

1006 

Upstream 

MKL2 

NA 

chr6:961 32093-961 321 85 

85 

0 

MANEA 

NA 

chrl  :556651 -55771 6 

953 

54181 

Upstream 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:7789387-7789393 

chrl  5:22965642-22965934 

167 

214415 

Upstream 

SNRPN 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  8:31 178648-31 178654 

chrl  8:31 177573-31 17781 2 

240 

591 

Upstream 

ZNF24 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:1 59341 088-159341 094 

chr6:1 59343965-1 59344039 

39 

2779 

Downstream 

RSHL2 

NA 

chr5:1 34286943-1 34287591 

454 

18235 

PCBD2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  6:31 134763-31 134769 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:10486010-10486180 

119 

33170 

RNF141 

chr3:48647002-48647008 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  5:61 237072-61 2371 32 

61 

278 

Downstream 

RPS27L 

NA 

chrl  :558006-558507 

462 

53390 

Upstream 

OR4F3 

chrl  1 :59074846-59074852 

NA 

NA 

NA 

NA 

NA 

chrl  3:46268725-46268731 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:9296007-929601 3 

NA 

NA 

NA 

NA 

NA 

chrl  3:46268725-46268731 

NA 

NA 

NA 

NA 

NA 

NA 

chr7 :20784444-20784799 

344 

8231 

■Bn 

SP8 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  3:99431 429-99431 568 

140 

751 

Downstream 

ZIC2 

chrl  5:80342386-80342392 

chrl  5:8253941 8-82539905 

280 

217498 

Upstream 

KIAA1920 

chrl  9:6261 4755-6261 4761 

chrl  9:62638587-62638892 

126 

7697 

Downstream 

ZNF749 

NA 

chrl  7:21 948201 -21 948643 

143 

198705 

Upstream 

FAM27L 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:70256646-70257025 

376 

123 

Upstream 

SMN1 

chr4:76868646-76868652 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr22:45537300-45537306 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  8:73075966-73075972 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  3:26723690-26723856 

163 

0 

RPL21 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :3531 7056-3531 7207 

116 

351 

Downstream 

ZMYM1 

chrl  0:79459309-7945931 5 

chrl  0:79459401  -79459459 

59 

136 

Downstream 

POLR3A 

Best  Location 

Location 

Length 

Genel 

Distance 

Genel  Direction 

Genel  Symbol 

chr4:492091 90-492091 96 

chrl  :1 421 24430-1421 24689 

208 

950914 

Upstream 

PPIAL4 

NA 

chr4:79321 923-7932231 9 

357 

123804 

£lKj£U£EUlBi 

FRAS1 

chrl  0:1 22728535-1 22728541 

NA 

NA 

NA 

NA 

NA 

chrl  7:34983952-34983958 

chrl  7:34983307-34983397 

91 

34304 

IHHiH 

NEUROD2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:70256646-70257025 

372 

123 

SMN1 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:960424-960430 

chrl  1 :959953-960074 

102 

44113 

IHUsSBSH 

AP2A2 

NA 

chrl  :554327-554724 

386 

57173 

Upstream 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  6:21 45055-21 45061 

chrl  0:1 28458275-1 28458329 

35 

125683 

Downstream 

DOCK1 

chrl  :1 0381309-1 038131 5 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  3:40666278-40666822 

290 

0 

1 

KBTBD7 

chr8:1 32985062-1 32985068 

chr8:1 32985248-1 32985467 

216 

35862 

Downstream 

KIAA0143 

chrl  :1 60734241  -1 60734247 

NA 

NA 

NA 

NA 

NA 

chr9:1 1638981 2-1 1638981 8 

chrl  4:1 053231 46-1 05323287 

50 

259149 

Upstream 

TMEM121 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr22:25086936-25086942 

chr22:25086887-25087002 

116 

68299 

Downstream 

ASPHD2 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1  77025630-77025636 

chr4:1 70990265-1 70990503 

97 

74642 

Downstream 

C4orf27 

NA 

chr4:1 531 86090-1531 86280 

170 

284494 

Downstream 

PET112L 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  2:47532233-47532522 

290 

9 

Downstream 

DDX23 

NA 

chrl  :1 44656895-1 44657026 

68 

111953 

NBPF8 

NA 

chr6:58249909-58250236 

324 

145419 

Upstream 

GUSBL2 

NA 

chrl  :1 6939839-1 6940263 

377 

127270 

Downstream 

NBPF1 

NA 

chr3:1 1 6859547-1 1 6859779 

217 

34406 

Upstream 

GAP43 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:351 1771 9-351 18002 

273 

727 

Upstream 

CD44 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:21 11 6591 -21 11 6976 

228 

0 

ZNF431 

chr2:71210309-71210315 

chr2:71 21 0964-71 21 1552 

565 

13 

Upstream 

MPHOSPH10 

chr5:56241 865-56241 871 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  3:26723690-26723856 

163 

0 

Overlapping 

RPL21 

chr8:1 2881 6091  -1 2881 6097 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  7:21 944883-21 945032 

90 

195387 

ihsssbsh 

FAM27L 

chr22:25086936-25086942 

chr22:25086887-25087002 

116 

68299 

Downstream 

ASPHD2 

NA 

chrl  :5561 16-55771 6 

1484 

54181 

IHSsSBjH 

OR4F3 

chrl  2:551 48567-551 48573 

NA 

NA 

NA 

NA 

NA 

chr2:1 321 46992-1 321 46998 

chrl  4:1 871 1904-18712927 

808 

41854 

Downstream 

LOC440157 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:1 2881 6091  -1 2881 6097 

NA 

NA 

NA 

NA 

NA 

NA 

chr3:1 71 854552-1 71 854661 

70 

68000 

Downstream 

SLC7A14 

NA 

NA 

NA 

NA 

NA 

NA 

chr3:25798739-25798745 

NA 

NA 

NA 

NA 

NA 

NA 

chr3:1 97850925-1 97851 375 

443 

0 

1 

LRRC33 

NA 

chrl  :1 6939839-1 6940263 

377 

127270 

Downstream 

NBPF1 

NA 

NA 

NA 

NA 

NA 

NA 

Best  Location 

Location 

Length 

Genel 

Distance 

Genel  Direction 

Genel  Symbol 

NA 

chr2:32280092-32280255 

116 

35656 

Upstream 

SLC30A6 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr2:1 87059526-1 87059532 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  6:30794542-30794548 

chrl  6:30794200-30794254 

55 

18646 

HSsSBSH 

BCL7C 

NA 

chrl  :558000-558507 

456 

53390 

Upstream 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  0:1 03532601  -1 03532607 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :1 45999056-1 45999062 

chrl  :1 44702006-1 44702550 

537 

66429 

HSsSBK 

NBPF8 

NA 

chrl  2:52432046-524321 68 

111 

24560 

Downstream 

CALCOCOI 

NA 

chrl  :558000-558500 

465 

53397 

MSsSBH 

OR4F3 

chr2:1 88864671  -1 88864677 

chr2:1 88864724-1 88864924 

201 

710 

Downstream 

GULP1 

NA 

chrl  7:1 895631 5-1 8956457 

131 

65254 

Downstream 

GRAP 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  4:1 04706859-1 04706930 

40 

653 

Downstream 

JAG2 

NA 

chrl  :1 1676241 0-1 16762575 

166 

45052 

Upstream 

ATP1A1 

chrl  1:1 11 254805-1 11 254811 

NA 

NA 

NA 

NA 

NA 

chrl  9:42021 172-42021 178 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:1 21 6971 00-121 697222 

123 

121 

C6orf170 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  0:30695507-30695853 

163 

17253 

Downstream 

PAPD1 

NA 

chrl  :558000-558507 

468 

53390 

Upstream 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:1 7032825-1 7032831 

chr9:1 7032364-1 7032500 

133 

92537 

Downstream 

C9orf39 

NA 

chr5:1 34286935-1 34287591 

465 

18227 

HSsSBSH 

PCBD2 

NA 

chrl  9:21 1 1 6588-21 117166 

272 

0 

Overlapping 

ZNF431 

NA 

chrl  9:6071 9936-60720068 

133 

1776 

Downstream 

FLJ35258 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:98725543-98725549 

chr8:98725301 -98725388 

88 

194 

Downstream 

MTDH 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:1 84001 108-184001 114 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  7:4201 1770-4201 2558 

769 

0 

Overlapping 

ARL17P1 

NA 

chrl  1:65412670-65413016 

264 

84 

Downstream 

FIBP 

chrl  2:1 11 860995-1 11 861 001 

chr22:1 6640527-1 6640826 

288 

3269 

Downstream 

BID 

chr3:1 38968943-138968949 

NA 

NA 

NA 

NA 

NA 

chr2:1 37238564-1 37238570 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:3024751 5-30247521 

NA 

NA 

NA 

NA 

NA 

chrl  4:38971 494-38971 500 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:67316659-67317125 

403 

111398 

Downstream 

ALDH3B2 

NA 

chrl  :1 4402931 9-144030296 

951 

94338 

Downstream 

HFE2 

NA 

chr8:22475009-22475058 

46 

4101 

Downstream 

SORBS3 

NA 

chrl  :1 4246421 9-14246451 9 

142 

611084 

PPIAL4 

NA 

chrl  :558000-558507 

468 

53390 

OR4F3 

NA 

chrl  :1 3553453-1 3553544 

92 

38894 

HBBI 

PRAMEF9 

NA 

chr3:75728395-75728440 

42 

1 075362 

Downstream 

CNTN3 

NA 

chrl  9:6301 7577-6301 7923 

142 

170 

umsm 

ZNF552 

chr9:351 06373-351 06379 

chr9:351 05993-351 06202 

198 

100 

Downstream 

KIAA1539 

NA 

chrl  :558000-558507 

476 

53390 

HHiSH 

OR4F3 

chr7:1 4841 8543-14841 8549 

chr7:1 4841 9322-14841 9608 

287 

602 

Downstream 

ZNF786 

NA 

chrl  :556656-55771 6 

900 

54181 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

Best  Location 

Location 

Length 

Genel 

Distance 

Genel  Direction 

Genel  Symbol 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:1 25604379-1 25604459 

57 

246923 

BUB 

ANKRD50 

NA 

NA 

NA 

NA 

NA 

NA 

chr7:1 51 203334-1 51 203340 

NA 

NA 

NA 

NA 

NA 

chr7:451 1771 3-451 1771 9 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:21 11 6627-21 11 7208 

142 

0 

ZNF431 

NA 

chr5:1 34286946-1 34287591 

384 

18238 

Upstream 

PCBD2 

NA 

chrl  :558000-558507 

456 

53390 

OR4F3 

NA 

chrl  :39752855-39753648 

771 

22951 

Upstream 

BMP8A 

chr9:33463904-3346391 0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558006-558507 

470 

53390 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :5581 30-558507 

338 

53390 

■sn 

OR4F3 

chrl  5:301 09929-301 09935 

chrl  5:301 1031 8-301 10388 

71 

301 

Upstream 

CHRNA7 

NA 

chrl  7:341 57430-341 57722 

293 

362 

SHBUuKi 

PCGF2 

NA 

chrl  2:31 91 339-31 91 378 

36 

134522 

Upstream 

TSPAN9 

chrl  1:1 8699664-1 8699670 

NA 

NA 

NA 

NA 

NA 

chr7:74909969-74909975 

chr7:76094085-76094426 

338 

130 

Upstream 

POMZP3 

NA 

chrl  :558006-558507 

470 

53390 

OR4F3 

chrl  1 :60565869-60565875 

NA 

NA 

NA 

NA 

NA 

chrl  1 :32406378-32406384 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  7:60203895-60204054 

128 

115047 

Downstream 

SMURF2 

chrl  8:321 1561 9-321 15625 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :1 593901 94-159390634 

433 

22 

Upstream 

UFC1 

NA 

chr2:1 28332289-1 28332495 

203 

115 

Downstream 

POLR2D 

chrl  7:76481 158-76481 164 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558347-558507 

153 

53390 

mm 

OR4F3 

chrl  3:781 31 204-781 31 210 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 72589588-1 72589792 

201 

5076 

Upstream 

NKX2-5 

chr8:1 1 0726265-1 1 0726271 

chr8:1 10725074-1 107251 18 

45 

0 

FLJ20366 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:31 973479-31 973485 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558000-558507 

476 

53390 

Upstream 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 34286935-1 34287597 

463 

18227 

Upstream 

PCBD2 

chrl  7:1 750801 7-1 7508023 

chrl  7:1 750721 6-1 7507273 

58 

18238 

Downstream 

RAM 

chrl  7:568281 36-568281 42 

NA 

NA 

NA 

NA 

NA 

chrl  7:595771 99-59577205 

chr4:1 85773446-1 85773520 

55 

34103 

BHUm 

CASP3 

NA 

chrl  :554327-554724 

386 

57173 

Upstream 

OR4F3 

NA 

chr5:1 34286935-1 34287591 

469 

18227 

SHBUuHl 

PCBD2 

chr3:2525351 5-25253521 

NA 

NA 

NA 

NA 

NA 

chrl  8:50049746-50049752 

NA 

NA 

NA 

NA 

NA 

NA 

chr2:1 76842851  -1 76843034 

133 

469 

Upstream 

MTX2 

NA 

chrl  75375433-75375899 

463 

8727 

LHX8 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558000-558507 

468 

53390 

OR4F3 

chrl  1  7965392-7965398 

chrl  1  7965622-7965856 

231 

180 

flMliwu.fr  ml 

EIF3S5 

chr8:1 01 928959-1 01 928965 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr2:1 32742653-1 32742783 

86 

148302 

Downstream 

GPR39 

chrl  9:998061 0-998061 6 

chrl  9:9980788-9980827 

40 

1320 

mm 

COL5A3 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr7:2722691 2-27227028 

117 

20662 

Downstream 

HOXA13 

Best  Location 

Location 

Length 

Genel 

Distance 

Genel  Direction 

Genel  Symbol 

chrl  1 :1 24241 626-124241 632 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  77209567-77209688 

82 

103794 

HMHB 

ST6GALNAC5 

NA 

chrl  :35358967-35359457 

274 

41409 

Upstream 

ZMYM1 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:79982889-799831 33 

217 

2916 

Downstream 

MSH3 

NA 

chrl  :558000-558507 

468 

53390 

Upstream 

OR4F3 

NA 

chrl  :558000-558507 

476 

53390 

■!HK 

OR4F3 

NA 

chrl  2:9327921 -9328232 

264 

75688 

Downstream 

PZP 

chrl  7:75392578-75392584 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr20:43973074-439731 39 

62 

1054 

Upstream 

PLTP 

NA 

chrl  :558000-558505 

474 

53392 

OR4F3 

chr2:85050802-85050808 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :1 45866984-1 45868073 

1046 

0 

GPR89A 

chrl  :255361 97-25536203 

chrl  :25465796-25466286 

475 

5281 

Downstream 

RHD 

NA 

chr8:57285797-57285879 

83 

513 

HSsSBSHI 

PLAG1 

chr5:1 72647030-1 72647036 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  7:1 41 41 51 4-1 41 41 603 

70 

3627 

Downstream 

HS3ST3B1 

NA 

NA 

NA 

NA 

NA 

NA 

chr3:1 2500700-1 2500706 

NA 

NA 

NA 

NA 

NA 

chrl  1:731 68008-731 6801 4 

NA 

NA 

NA 

NA 

NA 

chr3:27731 747-27731 753 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  0:41 981 232-41 981 272 

37 

472726 

Upstream 

ZNF33B 

chr6:41 997243-41 997249 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:1 20440701  -1 20440802 

98 

433 

Upstream 

LOC401 152 

NA 

chrl  :558000-558507 

476 

53390 

■9S&SE. 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:1 04496740-1 04496746 

chrl  5:490241 19-4902421 9 

65 

35882 

mm 

AP4E1 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:1 44458684-1 44458690 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :556840-55771 9 

785 

54178 

Upstream 

OR4F3 

NA 

chrl  9:51 61 1 826-5161 1 949 

99 

3145 

Downstream 

CCDC8 

chrl  9:4059570-4059576 

chrl  9:4059678-405981 8 

141 

15308 

Upstream 

MAP2K2 

chr7:99536976-99536982 

NA 

NA 

NA 

NA 

NA 

chrl  :1 1718715-1 1718721 

chrl  :1 171 8450-1 171 8755 

302 

0 

Overlapping 

AGTRAP 

chr2:1 5649792-1 5649798 

chrl  6:287431 36-2874341 5 

167 

1222 

ATXN2L 

NA 

chrl  :1 4246421 9-14246451 9 

142 

611084 

Upstream 

PPIAL4 

NA 

chr2:8551 8339-8551 8451 

113 

2911 

■9S&SH 

SH2D6 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 48704707-1 4870471 3 

chr5:1 48704700-1 48704766 

67 

403 

Downstream 

GRPEL2 

NA 

chr5:1 34289000-1 34289705 

513 

20292 

uissnm 

PCBD2 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:1 28773840-1 28773846 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1 :44289643-44289734 

84 

1351 

Downstream 

ALX4 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:5821 8334-5821 8478 

145 

149377 

Downstream 

IMPAD1 

NA 

chrl  :558000-558507 

476 

53390 

■HBB 

OR4F3 

NA 

chr3:1 59305720-1 59305923 

200 

707 

■!HiH 

SHOX2 

chr4:41 632661 -41 632667 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  2:75055075-750551 74 

72 

211179 

BBS10 

NA 

chr5:1 32229644-1 32229822 

179 

395 

Downstream 

UQCRQ 

chr4:41 632661 -41 632667 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:49290248-49290336 

53 

0 

ZNF224 

Best  Location 

Location 

Length 

Genel 

Distance 

Genel  Direction 

Genel  Symbol 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  5:29533409-29533753 

288 

127035 

Upstream 

KLF13 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:53331 361 -53331 573 

213 

9460 

Downstream 

ELOVL5 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:351 06373-351 06379 

chr9:351 05993-351 06202 

198 

100 

Downstream 

KIAA1539 

NA 

chrl  9:61 71 1 435-6171 1 643 

192 

412 

HBsSBSH 

ZNF471 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:5691 9663-5691 9669 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  9:1 3746443-1 3746449 

chrl  9:1 3746645-1 3746982 

334 

389 

C19orf53 

NA 

chrl  :401 39584-401 39626 

35 

648 

Upstream 

MYCL1 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558007-558507 

469 

53390 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:83397844-83397850 

chr9:81 523458-81 52371 7 

212 

146761 

MSssnro 

TLE4 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:10486010-10486181 

120 

33169 

HSSSBSHI 

RNF141 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  5:721 79475-721 79803 

227 

17531 

Downstream 

GOLGA6 

chr21 :46905251  -46905257 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:10486010-10486180 

119 

33170 

Upstream 

RNF141 

NA 

chr21 :37275040-372751 48 

109 

9225 

■9S&SH 

HLCS 

NA 

chr2:66656086-66656231 

146 

140051 

Upstream 

MEIS1 

NA 

chr6:26266579-26266757 

155 

252 

HIST1H2BD 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  8:321 1561 9-321 15625 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558000-558507 

476 

53390 

Upstream 

OR4F3 

chrl  6:2953823-2953829 

chrl  6:295391 7-2954095 

179 

122 

Downstream 

KREMEN2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr6:268881 28-26888555 

428 

120387 

Downstream 

ZNF322A 

NA 

chrl  9:49222671  -49222844 

146 

1316 

Upstream 

ZNF222 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:1 01 392063-1 01 392069 

chr8:1 01 391 276-1 01391501 

226 

2 

Upstream 

RNF19 

NA 

NA 

NA 

NA 

NA 

NA 

chr5:1 4071 0307-14071 0313 

NA 

NA 

NA 

NA 

NA 

chr7:91 348260-91 348266 

NA 

NA 

NA 

NA 

NA 

NA 

chr8:6201 4926-6201 5084 

115 

261034 

Upstream 

CHD7 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:10486010-10486180 

119 

33170 

Upstream 

RNF141 

NA 

chr5:4250431 1-42504406 

60 

44529 

ttwMiraillWSj 

GHR 

NA 

chr6:268881 28-26888562 

431 

120387 

Downstream 

ZNF322A 

NA 

chrl  :51 206925-51 206968 

44 

0 

CDKN2C 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :1 4808991 2-148090270 

347 

534 

Downstream 

HIST2H3C 

chr7:439321 22-439321 28 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  :558000-558507 

476 

53390 

OR4F3 

NA 

NA 

NA 

NA 

NA 

NA 

Best  Location 

Location 

Length 

Genel 

Distance 

Genel  Direction 

Genel  Symbol 

NA 

chrl  :1 480991 91  -1 48099464 

262 

0 

Overlapping 

HIST2H4A 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  2:551 67696-551 67702 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:8875251 9-88752648 

130 

595 

Downstream 

GAS1 

chr8:65657029-65657035 

chr20:61 108007-61 1081 87 

133 

600 

BHLHB4 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr20:43973066-439731 38 

73 

1055 

PLTP 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chrl  1:10486010-10486180 

119 

33170 

RNF141 

NA 

chrl  :1 47481 998-147482346 

176 

72631 

Downstream 

LOC388692 

chr4:1 09760420-1 09760426 

chr6:43403741  -43403841 

50 

19233 

Downstream 

CRIP3 

NA 

chrl  5:41 768914-41 769414 

501 

2961 

Downstream 

CKMT1A 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr9:349891 41  -34989291 

72 

9357 

Upstream 

DNAJB5 

chr5:879931 45-879931 51 

chr5:87992407-87992576 

159 

222204 

■!HH 

MEF2C 

chrl  :1 73259496-1 73259502 

NA 

NA 

NA 

NA 

NA 

chr3:47842646-47842652 

chr3:47841 127-47841433 

299 

46 

Downstream 

DHX30 

NA 

chrl  0:1 24899392-1 24899430 

39 

1765 

Upstream 

HMX2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

chr4:79321 930-7932231 9 

326 

123811 

Upstream 

FRAS1 

NA 

NA 

NA 

NA 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

340252 

1311787 

Downstream 

ZNF588 

51427 

1 

0 

chr7:62451 399-62451 405 

155185 

1 02792 

awMirauii 

TTYH3 

80727 

4 

0.001 

chr7 :2740980-2740986 

7091 

1421580 

Upstream 

PS  ATI 

29968 

3 

1 

chr9:81 52361 7-81 523623 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

1 70302 

501160 

POLA1 

5422 

0 

NA 

NA 

340252 

1311787 

Downstream 

ZNF588 

51427 

1 

0 

chr7:62451 399-62451 405 

152 

288172 

■mssBi 

OTOP1 

133060 

1 

0 

chr4:3991 009-3991 01 5 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7767 

19051 

Downstream 

ZNF225 

7768 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

79230 

50276 

Upstream 

MBD3L2 

125997 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

1 70959 

59743 

Upstream 

ZNF714 

148206 

0 

NA 

NA 

6909 

341855 

Downstream 

Cl  7orf82 

388407 

1 

0.008 

chrl  7:56501 577-56501 583 

89890 

61627 

Upstream 

KBTBD7 

84078 

0 

NA 

NA 

54834 

0 

WDR3 

10885 

3 

0.001 

chrl  :1 1 8274393-1 1 8274399 

79953 

265131 

Downstream 

CST7 

8530 

0 

NA 

NA 

81849 

248032 

■HH 

PIGK 

10026 

0 

NA 

NA 

6013 

34395 

Downstream 

RLN2 

6019 

0 

NA 

NA 

389741 

348783 

Downstream 

ANKRD20A1 

84210 

0 

NA 

NA 

401262 

29766 

Upstream 

SLC22A7 

10864 

0 

NA 

NA 

21 

64496 

Downstream 

RNPS1 

10921 

1 

0.005 

chrl  6:2322776-2322782 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6595 

606916 

Downstream 

VLDLR 

7436 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

84105 

43921 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

147687 

18843 

Upstream 

ZNF418 

147686 

2 

0 

chrl  9:631 19804-631 19810 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

79694 

438404 

Downstream 

FUT9 

10690 

0 

NA 

NA 

79937 

66908 

Downstream 

LOC642265 

642265 

1 

0 

chr9:39278506-3927851 2 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

5725 

23006 

Upstream 

PRG2 

79948 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

81691 

496 

Downstream 

EXOD1 

112479 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

440157 

239493 

■mssH 

POTE14 

404785 

1 

0 

chrl  4:1 8862824-1 8862830 

2241 

402687 

Downstream 

FBXL17 

64839 

0 

NA 

NA 

57037 

517 

Downstream 

BZW2 

28969 

0 

NA 

NA 

81849 

248032 

PIGK 

10026 

0 

NA 

NA 

8535 

40289 

Downstream 

CBX8 

57332 

1 

1 

chrl  7:75426285-75426291 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51574 

56735 

■9SH 

LARP7 

51574 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

55003 

28201 

Downstream 

TMEM14C 

51522 

1 

0.012 

chr6:1 08031 90-1 08031 96 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

7562 

67769 

Downstream 

ZNF493 

284443 

0 

NA 

NA 

26683 

54179 

jiwMirainM 

OR4F16 

81399 

0 

NA 

NA 

23332 

206818 

Upstream 

MKI67IP 

84365 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

9631 

622 

Downstream 

NUP155 

9631 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

3820 

221346 

Downstream 

CLEC2D 

29121 

0 

NA 

NA 

9064 

8682 

FCN3 

8547 

0 

NA 

NA 

NA 

NA 

1  NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51065 

31648 

Downstream 

RAB8B 

51762 

0 

NA 

NA 

51351 

98757 

Downstream 

ZNF117 

7670 

1 

0 

chr7:641 78750-641 78756 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

576 

55398 

Upstream 

SPOCD1 

90853 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

84105 

41805 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7673 

23755 

Upstream 

ZNF230 

7773 

0 

NA 

NA 

57653 

38795 

Downstream 

ZNF322B 

387328 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

389741 

348792 

Downstream 

ANKRD20A1 

84210 

0 

NA 

NA 

375513 

138692 

GUSBL2 

375513 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7009 

34271 

Downstream 

FMNL3 

91010 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

2067 

632 

Downstream 

ERCC1 

2067 

0 

NA 

NA 

8073 

83496 

Downstream 

KHDRBS1 

10657 

0 

NA 

NA 

64779 

11964 

Downstream 

FOXC2 

2303 

3 

0 

chrl  6:851 46022-851 46028 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

2690 

343375 

SEPP1 

6414 

1 

0 

chr5:42504066-42504072 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

57212 

4938 

Downstream 

CCDC27 

148870 

0 

NA 

NA 

389741 

348783 

Downstream 

ANKRD20A1 

84210 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

28974 

10309 

■mn 

MGC3207 

84245 

1 

0 

chrl  9:1 3746443-1 3746449 

9015 

70677 

Upstream 

C1orf80 

64853 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

84105 

41807 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

1539 

889241 

Downstream 

SMC2 

10592 

0 

NA 

NA 

56033 

84721 

Downstream 

PTPDC1 

138639 

0 

NA 

NA 

84172 

42622 

Downstream 

CHCHD5 

84269 

0 

NA 

NA 

5506 

505265 

Downstream 

GPR85 

54329 

0 

NA 

NA 

8622 

202130 

IHESISKI 

WDR41 

55255 

2 

0.539 

chr5:76621 71 7-76621 723 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

a»n.Miranii 

OR4F16 

81399 

0 

NA 

NA 

340252 

1311937 

Downstream 

ZNF588 

51427 

1 

0 

chr7:62451 399-62451 405 

84839 

8864 

MRPL54 

116541 

1 

0.999 

chrl  9:3722546-3722552 

8573 

422736 

■ISISSH 

GPR82 

27197 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

80256 

12839 

Downstream 

STOML2 

30968 

1 

1 

chr9:351 06373-351 06379 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

50862 

51954 

Upstream 

AMPD3 

272 

0 

NA 

NA 

26683 

53390 

liwjsumiiKI 

OR4F16 

81399 

0 

NA 

NA 

8370 

0 

Overlapping 

H4/o 

554313 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

4437 

3423 

DHFR 

1719 

1 

1 

chr5:79982702-79982708 

9747 

99714 

Downstream 

OR2F2 

135948 

1 

0.001 

chr7:1 431 63369-1 431 63375 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

55677 

107817 

Upstream 

PROC 

5624 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

28981 

18043 

Downstream 

IFT81 

28981 

0 

NA 

NA 

64147 

508 

Upstream 

KIF9 

64147 

0 

NA 

NA 

51002 

35122 

Downstream 

NAT8B 

51471 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

147687 

18843 

■mssH 

ZNF418 

147686 

2 

0 

chrl  9:631 19804-631 19810 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51065 

31648 

Downstream 

RAB8B 

51762 

0 

NA 

NA 

90317 

49728 

Downstream 

PPP2R1A 

5518 

1 

0 

chrl  9:57334896-57334902 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

81849 

248032 

Upstream 

PIGK 

10026 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

284123 

685682 

KCNJ12 

3768 

1 

0 

chrl  7:21 944926-21 944932 

144193 

328 

Downstream 

CCDC38 

120935 

1 

0 

chrl  2:94860961  -94860967 

94134 

127872 

sm 

KIF5B 

3799 

2 

0.837 

chrl  0:32257803-32257809 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6496 

80881 

SIX2 

10736 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

8726 

57238 

Downstream 

C11orf73 

51501 

0 

NA 

NA 

379 

76282 

TMEM106A 

113277 

0 

NA 

NA 

84105 

43921 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

3064 

112048 

GRK4 

2868 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

2067 

1728 

Downstream 

ERCC1 

2067 

1 

0.003 

chrl  9:50620785-50620791 

23 

13698 

Downstream 

GNL1 

2794 

1 

0 

chr6:306471 19-306471 25 

50862 

51954 

■SSI 

AMPD3 

272 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

284123 

685682 

Upstream 

KCNJ12 

3768 

1 

0 

chrl  7:21 944926-21 944932 

8341 

323 

Downstream 

HIST1H2AK 

8330 

1 

0.999 

chr6:2791 4001 -2791 4007 

84105 

40599 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

26083 

123456 

GOSR1 

9527 

0 

NA 

NA 

4968 

0 

Overlapping 

OGGI 

4968 

0 

NA 

NA 

3659 

52855 

IL5 

3567 

0 

NA 

NA 

54331 

128652 

Downstream 

C14orf166 

51637 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

7767 

18693 

Downstream 

ZNF225 

7768 

0 

NA 

NA 

9811 

437173 

Downstream 

ZBTB7C 

201501 

0 

NA 

NA 

55672 

129763 

Downstream 

LOC440570 

440570 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

1131 

705292 

Downstream 

FMN2 

56776 

1 

0 

chrl  :23761 5922-23761 5928 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

57496 

152179 

Upstream 

ERCC4 

2072 

0 

NA 

NA 

79694 

438404 

Downstream 

FUT9 

10690 

0 

NA 

NA 

26683 

54181 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6638 

214415 

Upstream 

SNURF 

8926 

2 

0.418 

chrl  5:22965758-22965764 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7572 

33487 

Upstream 

ZNF396 

252884 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

83861 

42133 

Upstream 

TAGAP 

1 1 7289 

0 

NA 

NA 

84105 

43921 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

50862 

51954 

iKiiim 

AMPD3 

272 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51065 

31648 

Downstream 

RAB8B 

51762 

0 

NA 

NA 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

221833 

130875 

ABCB5 

340273 

0 

NA 

NA 

NA 

NA 

1  NA 

NA 

NA 

NA 

NA 

NA 

7546 

9250 

Downstream 

ZIC5 

85416 

0 

NA 

NA 

114817 

375125 

Upstream 

FLJ43276 

388165 

3 

0.041 

chrl  5:82540386-82540392 

388567 

20774 

VN1R1 

57191 

0 

NA 

NA 

284123 

689000 

Upstream 

KCNJ12 

3768 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6606 

123 

Upstream 

SMN2 

6607 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6144 

18607 

Downstream 

RASL11A 

387496 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

79830 

46900 

Downstream 

ZMYM6 

9204 

0 

NA 

NA 

11128 

4120 

Downstream 

RPS24 

6229 

1 

0.266 

chrl  0:79459309-7945931 5 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

1 64022 

1201626 

Downstream 

NBPF20 

400818 

0 

NA 

NA 

80144 

318999 

MRPL1 

65008 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

4761 

53307 

Downstream 

PPP1R1B 

84152 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6606 

123 

jWl.Miraillf 

SMN2 

6607 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

161 

59130 

Downstream 

CHID1 

66005 

1 

0 

chrl  1 :960424-960430 

26683 

57173 

Upstream 

OR4F16 

81399 

1 

0.082 

chrl  :553920-553926 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

1793 

258275 

Downstream 

C10orf90 

118611 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

84078 

61342 

Downstream 

KBTBD6 

89890 

0 

NA 

NA 

23167 

576719 

Upstream 

KCNQ3 

3786 

1 

0.006 

chr8:1 32985062-1 32985068 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

80757 

294398 

Upstream 

C14orf80 

283643 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

57168 

118549 

HPS4 

89781 

1 

0.001 

chr22:25086936-25086942 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

54969 

170990 

MFAP3L 

9848 

0 

NA 

NA 

5188 

307148 

Upstream 

FBXW7 

55294 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

9416 

13398 

Upstream 

RND1 

27289 

0 

NA 

NA 

641559 

137075 

■wMiraillf 

NBPF11 

200030 

1 

0 

chrl  :1 44656682-1 44656688 

375513 

145419 

Upstream 

GUSBL2 

375513 

0 

NA 

NA 

55672 

129763 

Downstream 

LOC440570 

440570 

0 

NA 

NA 

2596 

510730 

Downstream 

ZBTB20 

26137 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

960 

222979 

Upstream 

PDHX 

8050 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

170959 

59743 

ZNF714 

148206 

0 

NA 

NA 

10199 

97 

Downstream 

MCEE 

84693 

1 

0 

chr2:7121 1434-7121 1440 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

6144 

18607 

Downstream 

RASL11A 

387496 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

284123 

685682 

■mssii 

KCNJ12 

3768 

1 

0 

chrl  7:21 944926-21 944932 

57168 

118549 

Upstream 

HPS4 

89781 

1 

0.001 

chr22:25086936-25086942 

26683 

54181 

BSiBH 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

440157 

88540 

MwMirailll: 

POTE14 

404785 

6 

0 

chrl  4:1 871 1628-1 871 1634 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

57709 

235194 

CLDN11 

5010 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

375387 

71115 

Downstream 

WDR53 

348793 

1 

0.021 

chr3:1 97851 563-1 97851 569 

55672 

129763 

Downstream 

LOC440570 

440570 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

55676 

64050 

Upstream 

CARD12 

58484 

1 

0.004 

chr2:32279825-32279831 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

9274 

21174 

Downstream 

CTF1 

1489 

1 

0.266 

chrl  6:30794542-30794548 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

641559 

91551 

NBPF11 

200030 

1 

0 

chrl :  144702045- 144702051 

57658 

75267 

Downstream 

ATP5G2 

517 

0 

NA 

NA 

26683 

53397 

OR4F16 

81399 

0 

NA 

NA 

51454 

498152 

Upstream 

DIRC1 

1 1 6093 

1 

0 

chr2:1 88864671  -1 88864677 

10750 

1 07530 

Downstream 

FAM83G 

644815 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

3714 

11755 

BwMiraniM 

NUDT14 

256281 

0 

NA 

NA 

476 

152551 

Upstream 

CD58 

965 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

221322 

101264 

Downstream 

GJA1 

2697 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

55149 

67018 

Downstream 

MAP3K8 

1326 

0 

NA 

NA 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

54875 

171578 

Downstream 

BNC2 

54796 

1 

0 

chr9:1 7032825-1 7032831 

84105 

43921 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

1 70959 

59740 

Upstream 

ZNF714 

148206 

0 

NA 

NA 

284297 

31531 

NAT  14 

57106 

0 

NA 

NA 

NA 

NA 

1  NA 

NA 

NA 

NA 

NA 

NA 

92140 

131596 

Downstream 

LAPTM4B 

55353 

2 

0.077 

chr8:98725543-98725549 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51326 

0 

Overlapping 

LOC641 522 

641522 

0 

NA 

NA 

9158 

1434 

Downstream 

CCDC85B 

11007 

0 

NA 

NA 

637 

3748 

Downstream 

BID 

637 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

222 

117981 

Downstream 

ALDH3B2 

222 

0 

NA 

NA 

148738 

108849 

NOTCH2NL 

388677 

0 

NA 

NA 

10174 

9814 

Upstream 

SORBS3 

10174 

0 

NA 

NA 

1 64022 

861796 

Downstream 

NBPF20 

400818 

0 

NA 

NA 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

343070 

69316 

Downstream 

PRAMEF8 

391002 

0 

NA 

NA 

5067 

1443543 

Downstream 

ROB02 

6092 

0 

NA 

NA 

79818 

35157 

Downstream 

ZNF587 

84914 

1 

0 

chrl  9:6301 7929-6301 7935 

80256 

12839 

Downstream 

STOML2 

30968 

1 

1 

chr9:351 06373-351 06379 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

136051 

34703 

■mi 

ZNF425 

155054 

0 

NA 

NA 

26683 

54181 

■SSI 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

57182 

852557 

Downstream 

FAT4 

79633 

1 

1 

chr4:1 2560401 2-1 2560401 8 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

1 70959 

59779 

ZNF714 

148206 

0 

NA 

NA 

84105 

43921 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

26683 

53390 

39HSI 

OR4F16 

81399 

0 

NA 

NA 

353500 

61355 

Upstream 

PABPC4 

8761 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

asms 

OR4F16 

81399 

0 

NA 

NA 

1139 

375484 

Downstream 

OTUD7A 

161725 

1 

0.041 

chrl  5:301 09929-301 09935 

7703 

4805 

Downstream 

PSMB3 

5691 

0 

NA 

NA 

10867 

252117 

Upstream 

TEAD4 

7004 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

22932 

1 1 6405 

Upstream 

UPK3B 

80761 

1 

1 

chr7:76094545-76094551 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

64750 

141311 

Upstream 

LRRC37A3 

374819 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51506 

5243 

Downstream 

USP21 

27005 

1 

0 

chrl  :1 59390727-1 59390733 

5433 

26409 

■mssBi 

MGC4268 

83607 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

1482 

85434 

Upstream 

BNIP1 

662 

0 

NA 

NA 

55638 

103589 

EBAG9 

9166 

0 

NA 

NA 

NA 

NA 

r  na 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

84105 

43915 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

10743 

71497 

Downstream 

PEMT 

10400 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

836 

34375 

Downstream 

CCDC111 

201973 

0 

NA 

NA 

26683 

57173 

Upstream 

OR4F16 

81399 

1 

0.082 

chrl  :553920-553926 

84105 

43921 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

10651 

81299 

Upstream 

HOXD1 

3231 

0 

NA 

NA 

431707 

404006 

gwMiraillWO: 

TYW3 

127253 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

8665 

23987 

Downstream 

NALP10 

338322 

1 

0.063 

chrl  1 :7965392-7965398 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

2863 

288737 

Downstream 

FLJ41821 

401011 

1 

0 

chr2:1 327431 93-1 327431 99 

50509 

4097 

Downstream 

RDH8 

50700 

1 

0 

chrl  9:998061 0-998061 6 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

3209 

21660 

Downstream 

EVX1 

2128 

0 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2 ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

81849 

248032 

MSsSIBBl 

PIGK 

10026 

0 

NA 

NA 

79830 

71865 

Upstream 

SFPQ 

6421 

1 

0 

chrl  :35358576-35358582 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

4437 

3423 

DHFR 

1719 

1 

1 

chr5:79982702-79982708 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

26683 

53390 

BUBUnKl 

OR4F16 

81399 

0 

NA 

NA 

5858 

168096 

Downstream 

A2M 

2 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

5360 

19462 

Upstream 

PPGB 

5476 

0 

NA 

NA 

26683 

53392 

HKESIBBi 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51463 

25425 

GJA8 

2703 

0 

NA 

NA 

6007 

19224 

Downstream 

C1orf63 

57035 

1 

0.001 

chrl  :25465868-25465874 

5324 

989 

Downstream 

CHCHD7 

79145 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

9953 

60639 

Downstream 

CDRT15 

146822 

1 

0 

chrl  7:1 41 41 181-14141187 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7582 

616992 

Downstream 

BMS1L 

9790 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

401152 

21964 

Upstream 

FABP2 

2169 

0 

NA 

NA 

26683 

53390 

HimSKI 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

23431 

160546 

■HE 

TNFAIP8L3 

388121 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

54178 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

83987 

54668 

SWSEiHl 

FLJ 10781 

55228 

0 

NA 

NA 

5605 

41862 

Downstream 

ZBTB7A 

51341 

1 

0 

chrl  9:4059570-4059576 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

57085 

44083 

Upstream 

C1orf187 

374946 

2 

0.001 

chrl  :1 1718715-1 1718721 

11273 

1222 

MwMiranil 

ATXN2L 

11273 

0 

NA 

NA 

1 64022 

861796 

Downstream 

NBPF20 

400818 

0 

NA 

NA 

284948 

27152 

Downstream 

CAPG 

822 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

134266 

13035 

Downstream 

PCYOX1 L 

78991 

1 

0 

chr5:1 48704707-1 4870471 3 

84105 

41807 

Downstream 

CATSPER3 

347732 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

60529 

215969 

Upstream 

EXT2 

2132 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

54928 

697191 

Downstream 

PENK 

5179 

1 

0.925 

chr8:5821 7978-5821 7984 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

6474 

2200 

Downstream 

SHOX2 

6474 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

79738 

290358 

Downstream 

NAP1L1 

4673 

0 

NA 

NA 

27089 

1268 

Downstream 

GDF9 

2661 

1 

0 

chr5:1 32229470-1 32229476 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7767 

19051 

Downstream 

ZNF225 

7768 

0 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

51621 

201081 

Upstream 

OTUD7A 

161725 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

60481 

186217 

Upstream 

GCLC 

2729 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

80256 

12839 

Downstream 

STOML2 

30968 

1 

1 

chr9:351 06373-351 06379 

57573 

30485 

Downstream 

ZFP28 

140612 

1 

0.998 

chrl  9:61 71 2022-61 71 2028 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

28974 

10309 

■n 

MGC3207 

84245 

1 

0 

chrl  9:1 3746443-1 3746449 

4610 

648 

Upstream 

MYCL1 

4610 

1 

0.006 

chrl  :401 39488-401 39494 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

7091 

1421580 

HwMirailll 

PS  ATI 

29968 

3 

1 

chr9:81 52361 7-81 523623 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

50862 

51954 

HlWMirailll, 

AMPD3 

272 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

342096 

30194 

Downstream 

ISLR2 

57611 

1 

0.023 

chrl  5:721 79669-721 79675 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

50862 

51954 

Upstream 

AMPD3 

272 

0 

NA 

NA 

3141 

25584 

Downstream 

DSCR6 

53820 

0 

NA 

NA 

4211 

821714 

Downstream 

ETAA1 

54465 

0 

NA 

NA 

3017 

252 

HIST1H2BD 

3017 

2 

0.001 

chr6:262671 31 -262671 37 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

Upstream 

OR4F16 

81399 

0 

NA 

NA 

79412 

5503 

Downstream 

PAQR4 

124222 

2 

0 

chrl  6:2953823-2953829 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

79692 

143747 

GUSBL1 

387036 

0 

NA 

NA 

7673 

23755 

Upstream 

ZNF230 

7773 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

25897 

6613 

Downstream 

RNF19 

25897 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

55636 

348019 

Downstream 

MGC34646 

157807 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

50862 

51954 

Upstream 

AMPD3 

272 

0 

NA 

NA 

2690 

343375 

ISHBl 

SEPP1 

6414 

1 

0 

chr5:42504066-42504072 

79692 

143740 

Upstream 

GUSBL1 

387036 

0 

NA 

NA 

1031 

1261 

Downstream 

CDKN2C 

1031 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

126961 

534 

Downstream 

HIST2H3A 

333932 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

26683 

53390 

■9SH 

OR4F16 

81399 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

Second  Best  Hit 


Genel  ID 

Gene2 

Distance 

Gene2  Direction 

Gene2  Symbol 

Gene2  ID 

Total  E- 
Boxes 

Best  E- 
Box 

Best  Location 

8370 

0 

Overlapping 

H4/o 

554313 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

2619 

200730 

Downstream 

FLJ45537 

401535 

1 

0.003 

chr9:887531 19-88753125 

128408 

53504 

C20orf59 

63910 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

5360 

19454 

■BH 

PPGB 

5476 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

50862 

51954 

■m 

AMPD3 

272 

0 

NA 

NA 

388692 

77361 

Downstream 

NBPF1 

55672 

0 

NA 

NA 

401262 

29766 

SLC22A7 

10864 

0 

NA 

NA 

548596 

40583 

Downstream 

CATSPER2 

117155 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

25822 

41810 

Downstream 

C9orf131 

138724 

0 

NA 

NA 

4208 

391986 

Downstream 

TMEM161 B 

153396 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

22907 

21473 

DHX30 

22907 

1 

0 

chr3:47841 157-47841 163 

3167 

4429 

Downstream 

BUB3 

9184 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

80144 

319006 

Upstream 

MRPL1 

65008 

0 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

NA 

Supplementary  Table  2:  Primer  sequences  for  PCR  reactions 


Supplementary  Table  3:  Primer  sequences  for  Q-PCR  reactions 


Cyclin  D2  forward 

C  CTT  G  ACT  C  AAG  G  ATG  CGTT AG  A 

Cyclin  D2  reverse 

GAGCCGACT  GCGGT  GAAGT 

Chr  21  EBox  forward 

AAAACGCAGCCACTT  GGAGT 

Chr  21  EBox  reverse 

GCAGAAGTGCGGTTCATCAG 

GDAP2  forward 

GCAGAAAT  GCAAGAGAGAAAAGCT 

GDAP2  reverse 

CT  GACT  CCTCCTT  GTTT  GOT  CA 

RPL34  (1)  forward 

TCAAGAGTTGGGATTTAAACCCA 

RPL34  (1)  reverse 

TTCGTTGTCATGGACACTGCA 

RPL34  (2)  forward 

TACACACGTGCGCTTAATGTAAAATT 

RPL34  (2)  reverse 

AACTACTT  GAGGCTTTT  CAT  GGGT 

PHLPP  froward 

G  CG  G  G  ATG  G  AAC  ACT  GAGA 

PHLPP  reverse 

GGATTATGTGGCGATTCGGA 

TCERG1  forward 

CTTGCCCAGTGGAAATGCTC 

TCERG1  reverse 

T  CT  GTCGGCTT  CCAGGAGAG 

PKIB  forward 

GCATGTGGAACCCTTGCATAT 

PKIB  reverse 

GCATTCCCTGCAGATGTGC 

CDC14A  (1)  forward 

CCTCGGTCAGGTCCTCCTCT 

CDC14A  (1)  reverse 

TTCTTGCAGAAACCTCATCCCT 

CDC14A  (2)  forward 

T  CT  GGGAGGGAT  GAGGTTT  CT 

CDC14A  (2)  reverse 

TGGGTGGTAACATACGAAGGATC 

OGGI  forward 

ATG  G  C  ACT  GGTCTGATGTTGATT 

OGGI  reverse 

TGCAGCGGATTGACCCTT 

CDCA2  forward 

GCTTACCCTGCAGCACAGAATTA 

CDCA2  reverse 

CACCCAGAGCAGCT  CCT  GAC 

NOL6  forward 

ACTCCAGGCCCAAGTCCAG 

NOL6  reverse 

GCGAGTAATCGACTGATAATCCAA 

TRH  forward 

GCGCTT  CT  CT  CAAT  CT  CT  CT  GT  C 

TRH  reverse 

T  CT  GCACCACCT  CT  GGAGAT  CT 

KLC4  forward 

TT  GC  AGGT  GAGT  CTTT  G  AGGGTAT 

KLC4  reverse 

CATCTTTCTGGTAGTGCGAGAATAAG 

GAS1  forward 

GCAGCTCACATTCAGGCATTC 

GAS1  reverse 

TCCGAGCCTCCTTAAGAGTAGGA 

PLTP  forward 

GCAGGTCCGGAATGGTGAT 

PLTP  reverse 

GGTTT  CT  CT  GCTGCTT  CAGT  GAA 

Trrap 


Figure  1  A:  Number  of  Overlapping  Spots  at  P  <  0.05 
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Figure  IB:  TRRAP  binds  to  Myc-specific  target  genes.  Q-PCR  analysis  of  TRRAP 
specific  ChIP  using  PCR  primers  to  established  Myc  negative  and  positive  control  target  genes 
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Figure  2A:  Schematic  diagram  of  TRRAP  (3830  amino  acids)  highlighting  the  regions  we 
have  shown  that  do  (+)  and  do  not  (-)  interact  with  Myc  using  a  variety  of  technologies  to 
evaluate  interaction  both  in  vitro  (e.g.  GST-pulldown,  circular  dichroism)  and  in  vivo 
(e.g.  repressed  transactivator  assay,  co-immunoprecipitation).  From  this  analysis  we 
identified  Fragment  F8  as  a  minimal  Myc  Binding  Domain  (MBD). 


Figure:  TRRAP  Fragment  MBD2  interacts  with  c-Myc  in  mammalian  cells 
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Figure  2B:  Evidence  that  a  second  region  of  TRRAP  has  been  identified  as  a  Myc  Binding  Domain  (MBD2). 
Ectopic  expression  of  Flag-tagged  MBD2  fragment  in  cells  is  evident  in  chromatin-enriched  fraction  (A) 
Immunoprecipitation  of  Myc  using  antibody  directed  to  the  N-terminal  domain  (N262)  results  in  co- 
immunoprecipitation  of  both  Myc,  as  detected  by  western  blot  (WB)  using  antibody  9E10,  and  Flag-MBD2,  as 
detected  by  western  blot  using  anti-Flag  M2. 


